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(54) INFORMATION PROCESSOR, PORTABLE DEVICE, ELECTRONIC PET DEVICE, RECORDED 
MEDIUM ON WHICH INFORMATION PROCESSING PROCEDURE IS RECORDED, AND 
INFORMATION PROCESSING METHOD 

(57) In an infomnation processing apparatus, a port- 
able device, an electronic pet apparatus, recording 
medium storing information processing procedures and 
an infonnation processing method, various kinds of data 
is transmitted via a network, and in addition, words can 
be catalogued via voice. Further, various responses are 
made in accordance with user authentication, voice 
inputs and responses are classified into categories 
which are used as a basis for generating a response. 
Furthermore, the emotion of the electronic pet can be 
changed on the basis of a past history. 
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Description 

Technical Field 

[0001] The present invention relates to an infomna- 
tion processing apparatus, a portable device, an elec- 
tronic pet apparatus, recording medium storing 
information processing procedures and an infomiation 
processing method, and can be applied to various kinds 
of information device such as mobile telephones and 
personal computers. By exchanging various kinds of 
data required in generation of a response via a network 
and by using voice to catalogue words, the present 
invention realizes a more familiar electronic pet appara- 
tus, an information processing apparatus with an elec- 
tronic pet, a portable device and a recording medium 
storing information processing procedures. 

Background Art 

[0002] For conventional personal computers, so- 
called rearing simulation game software has been pro- 
posed. The rearing simulation game software is a game 
to rear a pet (that is, an electronic pet) in a virtual reality 
space provided by a computer. The pet rearing simula- 
tion game software allows easy communications with 
an electronic pet in comparison with really rearing a pet. 
[0003] By the way, a real pet performs various kinds 
of action depending to the physical condition thereof, 
the surrounding environment and so on. In addition, the 
pet recognizes the owner and performs actions different 
from actions to others. Moreover, the behavior may be 
changed by learning. 

[0004] If an electronic pet is capable of Imitating a 
variety of behaviors of a real pet, the electronic pet can 
be considered to be more familiar. 

Disclosure of Invention 

[0005] It is an object of the present invention 
addressing the problems described above to provide a 
more familiar electronic pet apparatus, an infomnation 
processing apparatus with an electronic pet, a portable 
device, a recording medium storing information 
processing procedures and an information processing 
method. 

[0006] In order to solve the problems described 
above, the present invention is applied to an information 
processing apparatus, a portable device or an elec- 
tronic pet apparatus, and relating to: a voice recognition 
means for outputting a result of voice recognition in con- 
formity with a predetermined recognition rule; an emo- 
tion generation means for generating an emotion 
parameter, which varies at least in accordance with the 
result of voice recognition and the lapse of time and 
indicates an emotion in a pseudo manner, in conformity 
with a predetennined emotion-parameter generation 
rule; and a response generation means for generating a 



response to the result of voice recognition in confonmity 
with a predetermined response generation rule based 
on at least the emotion parameter, the following means 
is included: a communication means for carrying out 
5 processing to update the recognition rule, the emotion- 
parameter generation rule and the response generation 
rule by connection to a predetermined network; or a 
communication means for can7ing out processing to 
update data required in the recognition rule, the emo- 
10 tion-parameter generation rule and the response gener- 
ation rule by connection to the predetermined network. 
[0007] In addition, the present invention is applied 
to an information processing apparatus, a portable 
device or an electronic pet apparatus, and also includes 
15 a communication means for acquiring at least the emo- 
tion parameter or data required in generation of the 
emotion parameter by connection to a predetennined 
network wherein the response generation means gen- 
erates a response depending on the emotion parameter 
20 acquired by the communication means or a response 
depending on an emotion parameter generated from 
the data acquired by the communication means. 
[0008] Furthermore, the present invention also pro- 
vides a recording medium storing information process- 
es ing procedures including: communication processing to 
execute a process to update the recognition rule, the 
emotion-parameter generation rule or the response 
generation rule by connection to a predetermined net- 
work; or communication processing to execute a proc- 
30 ess to update data required for the recognition rule, the 
emotion-parameter generation rule or the response 
generation rule by connection to the predetermined net- 
work. 

[0009] Moreover, the present invention also pro- 

35 vides a recording medium storing infomriation process- 
ing procedures including: communication processing to 
acquire at least the emotion parameter or data required 
in generation of the emotion parameter by connection to 
a predetermined network from equipment of the same 

40 type connected to the network; and response genera- 
tion processing to generate a response depending on 
the emotion parameter acquired by the communication 
processing or a response depending on an emotion 
parameter generated from the data acquired by the 

45 communication processing. 

[0010] Further, the present invention is applied to 
an information processing method and comprises: com- 
munication processing to execute a process to update 
the recognition rule, the emotion-parameter generation 

50 rule or the response generation rule by connection to a 
predetennined network; or communication processing 
to execute of a process to update data required for the 
recognition rule, the emotion-parameter generation rule 
or the response generation rule by connection to the 

55 predetermined network. 

[0011] In addition, the present Invention is applied 
to an information processing method and comprises: 
communication processing to acquire at least the emo- 
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tion parameter or data required in generation of the 
emotion parameter by connection to a predetemriined 
network; and response generation processing to output 
a response depending on the emotion parameter 
acquired by the communication processing or a 
response depending on an emotion parameter gener- 
ated from the data acquired by the communication 
processing. 

[0012] Moreover, the present invention is applied to 
an information processing apparatus, a portable device 
or an electronic pet apparatus, includes a cataloging 
means capable of changing a data base via voice, on 
the basis of a result of voice recognition by at least cat- 
aloging a word obtained as a result of voice recognition 
in the data base in a cataloging operation mode. 
[0013] Furthemnore, the present invention also pro- 
vides a recording medium storing information process- 
ing procedures Including cataloging processing capable 
of changing a data base via voice the basis of a result of 
voice recognition by at least cataloging a word obtained 
as a result of voice recognition in the data base in a cat- 
aloging operation mode. 

[0014] Furthermore, the present invention is 
applied to an information processing method and com- 
prises cataloging processing capable of changing a 
data base via voice on the basis of a result of voice rec- 
ognition by at least cataloging a word obtained as a 
result of voice recognition in the data base in a cata- 
loging operation mode. 

[0015] In addition, as an application to an infomia- 
tlon processing apparatus, a portable device or an elec- 
tronic pet apparatus, the present invention has a user 
authentication means for authenticating the user on the 
basis of voice wherein the response generation means 
changes a response in accordance with the user and in 
dependence on a result of authentication output by the 
user authentication means. 

[0016] Furthennore. the present invention also pro- 
vides a recording medium storing information process- 
ing procedures including user authentication processing 
of authenticating the user on the basis of voice and 
response generation processing of changing a 
response In accordance with the" user and in depend- 
ence on a result of authentication output by the user 
authentication processing. 

[0017] Furthermore, as an application to an infor- 
mation processing method, the present invention com- 
prises user authentication processing of authenticating 
the user on the basis of voice and response generation 
processing of changing a response in accordance with 
the user and in dependence on a result of authentica- 
tion output by the user authentication processing. 
[0018] In addition, as an application to an informa- 
tion processing apparatus, a portable device or an elec- 
tronic pet apparatus, the present invention has a 
word/phrase classification means for identifying the type 
of an input expressed by voice in generation of a 
response to a result of voice recognition wherein a 



response generation rule is set as a rule for generating 
responses excluding a response of a predetemrjined 
type in accordance with the type of the voice input and 
on the basis of classification of responses according to 

5 classification of voice inputs. 

[0019] Furthermore, the present invention also pro- 
vides a recording medium storing, on the basis of the 
voice input, an infonnation processing procedure pre- 
scribing word/phrase classification processing to iden- 

10 tify the type of an input expressed by a voice in 
generation of a response to a result of voice recognition 
processing to set a response generation rule as a rule 
for generating responses excluding a response of a pre- 
detennined type in accordance with the type of the 

15 voice input and on the basis of classification of 
responses according to classification of voice inputs. 
[0020] Further, as an application to an information 
processing method, the present invention comprises 
information processing procedure for recognizing the 

20 type of voice input and generating a response to the 
result of voice recognition in accordance with the prede- 
termined response generation rule which is a rule of 
generating- responses excluding a response of a prede- 
termined type in accordance with the type of an input 

25 and a category of a response to the input. 

[0021] In addition, as an application to an infomna- 
tion processing apparatus, a portable device or an elec- 
tronic pet apparatus, the present invention has a history 
recording means for recording a history of at least 

30 results of voice recognition and emotion parameters 
corresponding to results of voice recognition wherein a 
change in emotion parameter corresponding to a result 
of voice recognition is varied in accordance with the his- 
tory 

35 [0022] Furthermore, the present invention also pro- 
vides a recording medium storing infonnation process- 
ing procedures prescribing history recording processing 
to record a history of at least results of voice recognition 
and emotion parameters corresponding to results of 

40 voice recognition to vary a change in emotion parame- 
ter corresponding to a result of voice recognition in 
accordance with the history. 

[0023] On the top of that, as an application to an 
information processing method, the present invention 

45 comprises history recording processing to record a his- 
tory of at least results of voice recognition and emotion 
parameters corresponding to results of voice recogni- 
tion to vary a change in emotion parameter con^espond- 
ing to a result of voice recognition in accordance with 

50 the history. 

[0024] In addition, as an application to an infomna- 
tion processing apparatus, a portable device or an elec- 
tronic pet apparatus, the present invention relates to: a 
voice recognition means for processing a voice and out- 

55 putting a result of voice recognition in conformity with a 
predetermined recognition rule; an emotion generation 
means for generating an emotion parameter, which indi- 
cates an emotion in a pseudo manner as well as varies 
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at least in accordance with the result of voice recogni- 
tion and varies with the lapse of time, in confornriity with 
a predetermined emotion-parameter generation rule; 
and a response generation means, for generating a 
response to a result of voice recognition in conformity 
with a predetermined response generation rule based 
on at least the emotion parameter, wherein there is 
included: a communication means' for carrying out 
processing to update the recognition rule, the emotion- 
parameter generation rule and the response generation 
rule by connection to a predetennined network; or a 
communication means for carrying out processing to 
update data required in the recognition rule, the emo- 
tion-parameter generation rule and the response gener- 
ation rule by connection to the predetermined network. 
[0025] Accordingly, the communication means is 
capable of outputting various kinds of data required in 
the generation of a response. Thus, equipment of the 
same type connected to the network is capable of gen- 
erating almost the same response as a response to a 
voice input in this information processing apparatus, the 
portable device or the electronic pet apparatus. As a 
result, an electronic pet can be treated as if the elec- 
tronic pet were taken out to the external equipment con- 
nected to the network and, moreover, the electronic pet 
can also be made easy to get acquainted with as if the 
electronic pet were a real pet in the course of actual 
training. 

[0026] In addition, as an application to an informa- 
tion processing apparatus, a portable device or an elec- 
tronic pet apparatus, the present invention includes a 
communication means for acquiring at least an emotion 
parameter or data required in generation of an emotion 
parameter by connection to a predetermined network 
wherein the response generation means generates a 
response depending on the emotion parameter 
acquired by the communication means or a response 
depending on an emotion parameter generated from 
the data acquired by the communication means. Thus, 
the response generation means is capable of generat- 
ing almost the same response as a response to a voice 
input in equipment of the same type connected to the 
network. As a result, an electronic pet can be treated as 
if the electronic pet were taken out from the equipment 
of the same type connected to the network and, moreo- 
ver, the electronic pet can be made easy to get 
acquainted with as if the electronic pet were a real pet in 
the course of actual training. In addition, the amount of 
knowledge can be enlarged if necessary typically by 
making the vocabulary of words that can be understood 
by the electronic pet larger. 

[0027] Furthermore, the present invention also pro- 
vides a recording medium storing information process- 
ing procedures prescribing: communication processing 
to execute a process to update the recognition rule, the 
emotion-parameter generation rule or the response 
generation rule by connection to a predetennined net- 
work; or communication processing to execute a proc- 



ess to update data required for the recognition rule, the 
emotion-parameter generation rule or the response 
generation rule by connection to the predetermined net- 
work. 

5 [0028] Thus, equipment of the same type con- 
nected to the network is capable of generating almost 
the same response as a response to voice input in an 
apparatus executing the information processing proce- 
dure stored in this recording medium. As a result, an 

10 electronic pet can be treated as if the electronic pet 
were taken out to the external equipment and, further- 
more, the electronic pet can be made easy to get 
acquainted with as if the electronic pet were a real pet in 
the course of actual training. 

15 [0029] Moreover, the present invention also pro- 
vides a recording medium storing information process- 
ing procedures prescribing: communication processing 
to acquire at least an emotion parameter or data 
required in generation of an emotion parameter by con- 

20 nection to a predetermined network; and response gen- 
eration processing to generate a response depending 
on the emotion parameter acquired by the communica- 
tion processing or a response depending on an emotion 
parameter generated from the data acquired by the 

25 communication processing. 

[0030] Thus, an apparatus executing the infonna- 
tion processing procedure stored In this recording 
medium is capable of generating almost the same 
response as a response to a voice input in the equip- 

30 ment of the same type connected to the network. As a 
result, an electronic pet can be treated as if the elec- 
tronic pet were taken out from the equipment of the 
same type connected to the network and, moreover, the 
electronic pet can be made easy to get acquainted with 

35 as if the electronic pet were a real pet in the course of 
actual training. In addition, the amount of knowledge 
can be enlarged if necessary typically by making the 
vocabulary of words that can be understood by the elec- 
tronic pet larger. 

40 [0031] On the top of that, as an application to an 
Information processing method, the present invention 
comprises: communication processing to execute a 
process to update the recognition rule, the emotion- 
parameter generation rule or the response generation 

45 rule by connection to a predetermined network; or com- 
munication processing to execute a process to update 
data required for the recognition rule, the emotion- 
parameter generation rule or the response generation 
rule by connection to a predetermined network, 

50 [0032] Thus, equipment of the same type con- 
nected to the network is capable of generating almost 
the same response as a response to a voice input in an 
apparatus executing the information processing 
method. As a result, an electronic pet can be treated as 

55 if the electronic pet were taken out to the external equip- 
ment and, furthermore, the electronic pet can be made 
easy to get acquainted with as if the electronic pet were 
a real pet in the course of actual training. 
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[0033] In addition, as an application to an informa- 
tion processing method, the present invention com- 
prises: communication processing to acquire at least 
the emotion parameter or data required in generation of 
the emotion parameter by connection to a predeter- 
mined network; and response generation processing to 
generate a response depending on the emotion param- 
eter acquired by the communication processing or a 
response depending on an emotion parameter gener- 
ated from the data acquired by the communication 
processing. 

[0034] Thus, an apparatus executing this infomna- 
tion processing method is capable of generating almost 
the same response as a response to a voice Input in the 
equipment of the same type connected to the networl<. 
As a result, an electronic pet can be treated as If the 
electronic pet were tal<en out from the equipment con- 
nected to the network and, moreover, the electronic pet 
can be made easy to get acquainted with as if the elec- 
tronic pet were a real pet in the course of actual training. 
In addition, the amount of knowledge can be enlarged if 
necessary typically by making the vocabulary of words 
that can be understood by the electronic pet larger. 
[0035] Moreover, as an application to an infomia- 
tion processing apparatus, a portable device or an elec- 
tronic pet apparatus, the present invention has a 
cataloging means capable of changing a data base in 
accordance with a voice input in a cataloging operation 
mode based on a result of voice recognition by at least 
cataloging a word obtained as a result of voice recogni- 
tion in the data base. Thus, the vocabulary of words that 
can be understood by an electronic pet can be made 
larger with ease by voice inputs. As a result, the elec- 
tronic pet can be made easy to get acquainted with as if 
the electronic pet were a real pet In the course of actual 
training. 

[0036] Furthennore, the present invention also pro- 
vides a recording medium storing information process- 
ing procedures prescribing cataloging processing 
capable of changing a data base in accordance with a 
voice input in a cataloging operation mode based on a 
result of voice recognition by at least cataloging a word 
obtained as a result of voice recognition in the data 
base. 

[0037] Thus, the vocabulary of words that can be 
understood by an electronic pet can be made larger with 
ease by voice inputs In an apparatus executing the infor- 
mation processing procedure stored in this recording 
medium. As a result, the electronic pet can be made 
easy to get acquainted with as if the electronic pet were 
a real pet in the course of actual training. 
[0038] On the top of that, as an application to an 
information processing method, the present invention 
comprises cataloging processing capable of changing a 
data base in accordance with a voice input in a cata- 
loging operation mode based on a result of voice recog- 
nition by at least cataloging a word obtained as a result 
of voice recognition In the data base. By executing this 



infomiation processing method, the vocabulary of 
words that can be understood by an electronic pet can 
thus be made larger with ease by voice inputs. As a 
result, the electronic pet can be made easy to get 
5 acquainted with as if the electronic pet were a real pet in 
the course of actual training. 

[0039] In addition, as an application to an infomna- 
tion processing apparatus, a portable device or an elec- 
tronic pet apparatus, the present invention has a user 

10 authentication means for authenticating the user 
wherein the response generation means changes a 
generated response in accordance with the user and in 
dependence on a result of authentication output by the 
user authentication means. Thus, the response of an 

15 electronic pet to the owner can be made different for 
example from that to a person other than the owner. As 
a result, the electronic pet can be made a pet which is 
easier to get acquainted with and behaves as if the elec- 
tronic pet were a real pet. 

20 [0040] Furthermore, the present invention also pro- 
vides a recording medium storing Infomriatlon process- 
ing procedures prescribing user authentication 
processing of authenticating the user and response 
generation processing of changing a generated 

25 response In accordance with the user and in depend- 
ence on a result of authentication output by the user 
authentication processing. Thus, the response of an 
electronic pet to the owner can be made different for 
example from that to a person other than the owner As 

30 a result, the electronic pet can be made a pet which is 
easier to get acquainted with and behaves as if the elec- 
tronic pet were a real pet. 

[0041] On the top of that, as an application to an 
infonnation processing method, the present invention 

35 comprises user authentication processing of authenti- 
cating the user and response generation processing of 
changing a generated response in accordance with the 
user and in dependence on a result of authentication 
output by the user authentication processing. Thus, the 

40 response of an electronic pet to the owner can be made 
different for example from that to a person other than 
the owner. As a result, the electronic pet can be made a 
pet which is easier to get acquainted with and behaves 
as if the electronic pet were a real pet. 

45 [0042] In addition, as an application to an informa- 
tion processing apparatus, a portable device or an elec- 
tronic pet apparatus, the present invention has a 
word/phrase classification means for Identifying the type 
of an input expressed by a voice in generation of a 

50 response to a result of voice recognition wherein a 
response generation rule is set as a rule for generating 
responses excluding a response of a predetermined 
type In accordance with the type of the voice input and 
on the basis of classification of responses according to 

55 classification of voice Inputs. It is thus possible to pre- 
vent an electronic pet from outputting an unnatural 
response such as a question raised in response to an 
inquiry. As a result, the response of the electronic pet 
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can be made natural as well as lively. In addition, the 
electronic pet can be made easier to get acquainted 
with. 

[0043] Furthermore, the present invention also pro- 
vides a recording medium storing information process- 
ing procedures prescribing word/phrase classification 
processing to identify the type of an Input expressed by 
a voice in generation of a response to a result of voice 
recognition processing to set a response generation 
rule as a rule for generating responses excluding a 
response of a predetemiined type In accordance with 
the type of the voice input and on the basis of classifica- 
tion of responses according to classification of voice 
inputs. It is thus possible to prevent an electronic pet 
from outputting an unnatural response such as a ques- 
tion raised In response to an Inquiry. As a result, the 
response of the electronic pet can be made natural as 
well as lively In addition, the electronic pet can be made 
easier to get acquainted with. 

[0044] On the top of that, as an application to an 
information processing method, the present invention 
comprises Information processing procedure for recog- 
nizing the type of voice input and generating a response 
to the result of voice recognition in accordance with the 
predetermined response generation rule which is a rule 
of generating responses excluding a response of a pre- 
determined type in accordance with the type of an input 
and a category of a response to the input. It is thus pos- 
sible to prevent an electronic pet from outputting an 
unnatural response such as a question raised in 
response to an inquiry. As a result, the response of the 
electronic pet can be made natural as well as lively In 
addition, the electronic pet can be made easier to get 
acquainted with. 

[0045] In addition, as an application to an infomna- 
tion processing apparatus, a portable device or an elec- 
tronic pet apparatus, the present invention has a history 
recording means for recording a history of at least 
results of voice recognition and emotion parameters 
corresponding to results of voice recognition wherein a 
change in emotion parameter corresponding to a result 
of voice recognition is varied in accordance with the his- 
tory. It is thus possible to create an electronic pet's 
response full of emotions of familiarity, intimacy and the 
like to for example a voice heard frequently. As a result, 
the response of the electronic pet can be made natural 
as well as lively In addition, the electronic pet can be 
made easier to get acquainted with. 
[0046] Furthermore, the present invention also pro- 
vides a recording medium storing information process- 
ing procedures prescribing history recording processing 
to record a history of at least results of voice recognition 
and emotion parameters corresponding to results of 
voice recognition to vary a change In emotion parame- 
ter con-esponding to a result of voice recognition in 
accordance with the history. It is thus possible to create 
an electronic pet's response full of emotions of familiar- 
ity, intimacy and the like to for example a voice heard 



frequently As a result, the response of the electronic 
pet can be made natural as well as lively In addition, the 
electronic pet can be made easier to get acquainted 
with. 

5 [0047] On the top of that, as an application to an 
infomiation processing method, the present invention 
comprises history recording processing to record a his- 
tory of at least results of voice recognition and emotion 
parameters corresponding to results of voice recogni- 

10 tion to vary a change in emotion parameter con'espond- 
ing to a result of voice recognition in accordance with 
the history. It is thus possible to create an electronic 
pet's response full of emotions of familiarity, intimacy 
and the like to for example a voice heard frequently. As 

15 a result, the response of the electronic pet can be made 
natural as well as lively. In addition, the electronic pet 
can be made more familiar. 
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Brief Description of Drawings 
[0048] 

Rg. 1 is a functional block diagram showing an 
electronic pet apparatus as implemented by an 
embodiment of the present invention. 
Fig. 2 is a diagram showing a front view of the elec- 
tronic pet apparatus shown in Fig. 1 . 
Fig. 3 is a hardware block diagram showing the 
electronic pet apparatus of Fig. 1 . 
Fig. 4 shows a table of data representing the physi- 
cal condition. 

Fig. 5 shows a table of data representing an emo- 
tion. 

Fig. 6 shows a table of character data. 

Fig. 7 shows a table of data representing a changed 

emotion. 

Fig. 8 shows rules described in pattern data. 

Fig. 9 shows a table of files each containing voice 

data. 

Fig. 10 shows a table of files each containing pic- 
ture data. 

Fig. 1 1 shows a flowchart representing a connec- 
tion processing procedure for connecting the elec- 
tronic pet apparatus to a network. 
Fig. 1 2 is a diagram showing the format of data out- 
put to the network. 

Fig. 13 is a functional block diagram showing the 
electronic pet apparatus in more detail in an opera- 
tion to catalog recognition data. 
Fig. 1 4 is a diagram showing syntax of a voice input 
subjected to a voice recognition process. 
Fig. 15 shows a flowchart representing a process- 
ing procedure for cataloging recognition data. 
Rg. 16 is a functional block diagram showing the 
electronic pet apparatus in an operation to authen- 
ticate the user in more detail. 
Fig. 17 shows rules of pattern data. 
Fig. 1 8 shows a typical dialog to know a favorite of 
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the user to be used later in authentication of the 
user. 

Fig. 19 shows a typical dialog to authenticate the 
user by using the favorite obtained during the con- 
versation shown in Fig. 18. 
Fig. 20 is a functional blocl( diagram showing the 
electronic pet apparatus in processing to classify a 
word or a phrase in more detail. 
Fig. 21 shows rules of pattern data for creation of 
responses to a variety of categories each obtained 
as a result of the processing to classify a word or a 
phrase shown in Fig. 20. 
Fig. 22 shows a typical conversation history. 
Fig. 23 shows another typical conversation history. 
Fig. 24 is a functional block diagram showing the 
electronic pet apparatus In execution of emotion 
control in more detail. 

Fig. 25 is a table of variations in emotions (charac- 
ter data) for different keywords each included in a 
user voice input. 

Fig. 26 is a table summarizing the variations in 
emotions (character data) shown in Fig. 25. 
Fig. 27 is a table summarizing changed variations 
in emotions (character data). 
Fig. 28 is a hardware block diagram showing an 
electronic pet apparatus as implemented by 
another embodiment of the present invention. 
Fig. 29 is a diagram showing a front view of a port- 
able telephone. 

Best Mode for Can7ing Out the Invention 

1 . First Embodiment 

1-1. Overall Configuration of the First Embodiment 

[0049] Fig. 2 is a diagram showing a front view of an 
electronic pet apparatus 1 implemented by a first 
embodiment of the present invention. As shown in the 
figure, the electronic pet apparatus 1 includes an 
antenna 2 which can be pulled out upward and a liquid- 
crystal display panel 3 on the upper portion of the front 
surface. The liquid-crystal display panel 3 employed in 
the electronic pet apparatus 1 displays the figure of an 
electronic pet and a message issued by the electronic 
pet. Under the liquid-crystal display panel 3, the elec- 
tronic pet apparatus 1 includes a confimn operator 4A, a 
cancel operator 4B and a cursor operator 5. These 
operators are operated to change the operating mode 
and to accomplish other purposes. 
[0050] The electronic pet apparatus 1 further 
includes a speaker 6 and a microphone 7 beneath the 
confirm and cancel operators 4A and 4B respectively. A 
conversation can be held with the electronic pet through 
the speaker 6 and the microphone 7. Furthenmore, the 
electronic pet apparatus 1 has a socket on the rear sur- 
face. The socket allows an IC card 8 to be mounted on 
the electronic pet apparatus 1. 



[0051] Fig. 3 is a block diagram showing hardware 
of the electronic pet apparatus 1 . As shown in the figure, 
the electronic pet apparatus 1 includes an analog-to- 
digital (A/D) conversion circuit 10 for converting an 

5 audio analog signal coming from the microphone 7 by 
way of an amplifier circuit not shown In the figure into 
digital audio data DA. The analog-to-digital conversion 
circuit 10 outputs the digital audio data DA to a central 
processing unit (CPU) 1 1 . In this way, the electronic pet 

10 apparatus 1 is capable of processing a voice entered by 
the user by using the central processing unit 1 1 . 
[0052] On the other hand, a digital-to-analog (D/A) 
conversion circuit 12 converts digital audio data DB pro- 
duced by the central processing unit 1 1 into an analog 

15 audio signal which is output to the speaker 6. In this 
way, the user is capable of verifying a voice of the elec- 
tronic pet generated by the electronic pet apparatus 1 to 
express a response generated by the electronic pet. 
[0053] Controlled by the central processing unit 1 1 , 

20 a monitor interface (monitor 1/F) 13 drives the liquid- 
crystal display panel 3 to display a picture of the elec- 
tronic pet on the liquid-crystal display panel 3 in accord- 
ance with picture data OV coming from the central 
processing unit 1 1 by way of a bus. 

25 [0054] A key interface (key l/F) 14 detects an oper- 
ation carried out by the user on the operator 4A, 4B or 
5, supplying a detection signal to the central processing 
unit 1 1 . A read-only memory (ROM) 15 is used for stor- 
ing Infomnation such as a processing program to be exe- 

30 cuted by the central processing unit 1 1 and various 
kinds of data necessary for an analysis of a voice 
acquired through the microphone 7. The central 
processing unit 1 1 reads out information from the read- 
only memory 15 to be output also under control exe- 

35 cuted by the central processing unit 11. A random- 
access memory (RAM) 1 6 serves as a work area of the 
central processing unit 1 1 . The random-access memory 
16 is used for temporarily storing various kinds of data 
necessary for processing carried out by the central 

40 processing unit 1 1 . 

[0055] Controlled by the central processing unit 1 1 , 
a network connection unit 17 connects the electronic 
pet apparatus 1 to a predetermined network 1 8 through 
a telephone line. The electronic pet apparatus 1 

45 exchanges various kinds of data DT with the network 1 8 
and, when necessary, updates information such as con- 
tents of the random-access memory 16 by using the 
exchanged data. To put it in detail, the electronic pet 
apparatus 1 is thus capable of acquiring various kinds of 

50 data required for training and nurturing the electronic 
pet from the network 18 when necessary. In addition, 
data stored in the random-access memory 1 6 may be 
transmitted to a desired temiinal by way of the network 
1 8. As a result, the electronic pet can be treated as if the 

55 pet were taken out to a variety of environments by 
exporting data to terminals connected to the network 
18. On the contrary, an electronic pet of another appa- 
ratus connected to the terminal 18 can be trained by 
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using this electronic pet apparatus. 
[0056] The IC card 8 is an external recording device 
that can be mounted and dismounted. If necessary, 
data stored in the IC card Is used for updating informa- 
tion such as the contents of the random-access mem- 
ory 1 6, or data stored in the random-access memory 1 6 
can be transferred to the IC card 8. In this way, the elec- 
tronic pet apparatus 1 is capable of exchanging various 
kinds of data with other equipment through the IC card 
8. making it possible to acquire various kinds of data 
necessary for training and nurturing of the electronic 
pet. In addition, the electronic pet can be treated as if 
the pet were taken out to a variety of environments and, 
on the contrary, an electronic pet of another apparatus 
can be trained by using this electronic pet apparatus 1 . 
[0057] Rg. 1 is a block diagram showing a basic 
configuration of the electronic pet apparatus 1 in terms 
of functional blocks. It should be noted that rectangular 
functional blocks shown in Fig. 1 each represent a 
processing program stored in the read-only memory 15 
to be executed by the central processing unit 1 1 . On the 
other hand, a functional block drawn as a symbol of a 
magnetic disc represents data stored in the read-only 
memory 1 5, the random-access memory 1 6 or the IC 
card 8. 

[0058] A voice recognition module 1 1 A employed in 
the electronic pet apparatus 1 carries out a voice recog- 
nition processing on audio data DA in conformity with a 
predetermined recognition rule, generating a result of 
voice recognition as an output. To put it In detail, the 
voice recognition module 11A delimits voice repre- 
sented by the sequentially received audio data DA by 
phonemes in accordance with a HMM (Hidden Marcov 
Model) method. The voice recognition module 11A ref- 
erences recognition data 16A for a series of such pho- 
nemes. The voice recognition module 11A produces 
words of the audio data DA, words of a phrase catal- 
oged in advance and, in the case of a phrase, words of 
the phrase or text data representing the phrase on the 
basis of results of the reference to the recognition data 
16A as results of recognition. The recognition data 16A 
is a data base associating text data of words and 
phrases with a series of phonemes output by the HMM 
method. That is to say, the recognition data 16A is a 
data base used for storing pairs each comprising text 
data and a phoneme. Such a data base allows the elec- 
tronic pet apparatus 1 to convert a voice of "A Good kid- 
said by the user in front of the microphone 7 into an 
array of characters representing a text of "A Good kid." 
As a result, a voice input is converted Into an array of 
characters. 

[0059] A timer 1 1 B invokes components such as a 
physical-condition changing module 11C and an emo- 
tion changing module 11 D at predetermined intervals. 
[0060] When activated by the timer 11 B, the physi- 
cal-condition changing module 11C updates physical- 
condition data 16B in accordance with a result of voice 
recognition. The physical-condition data 16B includes 



parameters representing the present physical condition 
of the electronic pet It should be noted that, in the case 
of this embodiment, the physical -condition data 16B 
comprises 5 parameters called "fatigue", "hunger", 

5 "thirstiness", "sickness" and "sleepiness" respectively 
as shown in Fig. 4. The larger the value of a parameter, 
the greater the share of the parameter in the physical 
condition of the electronic pet. The typical values shown 
in Fig. 4 thus indicate that, at the present time, the elec- 

10 tronic pet is extremely tired and very hungry. 

[0061] As described above, the physical-condition 
changing module 110 updates the physical-condition 
data 16B in accordance with a result of voice recogni- 
tion as activated by the timer 1 1 B. For example, the 

15 "hunger", "thirstiness" and "sleepiness" parameters are 
increased gradually in conformity with the rule of nature 
as is generally seen in the course of typical nurturing of 
a real pet. As a result, the electronic pet gets hungry 
with the lapse of time. Another example of an operation 

20 to update the physical-condition data 16B in accord- 
ance with a result of voice recognition is an operation to 
decrease the "hunger" parameter when a result of voice 
recognition indicates that food has been given to the 
electronic pet. Still another example of an operation to 

25 update the physical-condition data 1 6B in accordance 
with a result of voice recognition is an operation to 
decrease the "thirstiness" parameter when a result of 
voice recognition indicates that a drink has been given 
to the electronic pet. A further example of an operation 

30 to update the physical-condition data 16B in accord- 
ance with a result of voice recognition is an operation to 
gradually increase the "fatigue" parameter when a result 
of voice recognition indicates that owner is playing with 
the electronic pet. A still further example of an operation 

35 to update the physical-condition data 16B in accord- 
ance with a result of voice recognition is an operation to 
gradually decrease the "sleepiness" parameter syn- 
chronously with a timer when a result of voice recogni- 
tion indicates that the owner tells the electronic pet to 

40 sleep. 

[0062] On the other hand, the emotion changing 
module 1 1D updates the present emotion data 16C in 

accordance with a result of voice recognition as acti- 
vated by the timer 1 1B. The present emotion data 160 

45 includes variables representing emotions of the current 
electronic pet in a pseudo manner. Such variables are 
each also refen^ed to as a pseudo emotion parameter. It 
should be noted that, in the case of this embodiment, 
there are 6 pseudo emotion parameters which repre- 

50 sent "anger", "sadness", "joy", "fear", "surprise" and 
"hatred" emotions respectively as shown in Fig. 5. The 
larger the value of a pseudo emotion parameter, the 
greater the emotion represented by the parameter. A 
typical set of values of pseudo emotion parameters 

55 shown in Fig. 5 indicate that, at the present time, the 
electronic pet is joyful but angry. 
[0063] As described above, the emotion changing 
module 1 1 D updates the emotion data 160 in conform- 
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ity with the rule of nature as is generally seen in a the 
course of typical nurturing of a real pet. That Is to say, 
when the emotion changing module 1 1 D updates the 
emotion data 16C as activated by the timer 11 B, the 6 
variables, namely, the "anger", "sadness", "joy", "fear", 5 
■surprise" and "hatred" emotion parameters, are each 
gradually updated so as to approach a predetemnined 
reference value. Thus, in the case of the example 
shown in Fig. 5, the "anger", the "sadness" and the 
other emotions are gradually settled. w 
[0064] When the emotion changing module 11 D 
updates the emotion data 16C in accordance with a 
result of voice recognition, on the other hand, character 
data 16D is searched for information indicated by the 
result of voice recognition and the information is then 15 
used as a basis for updating the emotion data 1 6C. 
[0065] As shown in Fig. 6, the character data 1 6D 
comprises changes in emotion data 16C classified by 
phrase (or word) included in a result of voice recogni- 
tion. That is to say, by using a phrase included in a result 20 
of voice recognition as a keyword, the character data 
1 6D can be searched for variations in emotion data 1 6C 
proper for the phrase (or the word). Assume for example 
that the user says: "Good" to the electronic pet. In this 
case, the "anger", "sadness", "joy", "fear", "surprise" 25 
and "hatred" emotion parameters are changed by -1, 
+2, +20, -5, +5 and -1 respectively as shown in Fig. 6. In 
other words, variations in emotion data 16C of -1, +2, 
+20, -5, +5 and -1 are assigned to the word "Good". 
[0066] Thus, when the user says: "A Good kid," for 30 
example, the emotion changing module 11D updates 
the emotion data 16C shown in Fig. 5 to that shown in 
Fig. 7. In this way, the emotion changing module 11 D 
serves as an emotion generation means which gener- 
ates pseudo emotion parameters each representing an 35 
emotion in a pseudo manner and updates the emotion 
data on the basis of a predetermined emotion-parame- 
ter generation rule at least in accordance with a result of 
voice recognition. In addition, the pseudo emotion 
parameters also vary with the lapse of time. 40 
[0067] A response-sentence creation module 1 1 E 
generates a response to a result of voice recognition in 
accordance with predetermined response generation 
rules based on the physical-condition data 16B and the 
emotion data 1 6C. Pattern data 1 6E is a set of rules for 45 
generation of such a response. As shown in Fig. 8, each 
of the rules describes a response to an input key phrase 
which includes a word obtained as a result of voice rec- 
ognition. Detemnined by a key phrase, a response 
described by a rule also varies in accordance with the so 
emotion data 16C and the physical-condition data 168. 
It should be noted that only minimum require rules are 
shown in Fig. 8 in order to make the explanation simple. 
Actual rules prescribe conditions (including attributes to 
be described later) other than the conditions shown in 55 
Fig. 8. Rule 2 shown in Fig. 8 is an example of a rule 
based on emotion data 1 6C only. It should be noted that 
a rule can be based on a combination of the emotion 



data 16C and the physical-condition data 16B. 
[0068] Rule 1 shown in Fig. 8 prescribes response 
phrases to an input phrase "I love you" or "I like you." 
According to Rule 1, if the input phrase is a voice of an 
authenticated user, a response phrase saying: "l love 
you, too" or "Wow, I am a male though" is output at ran- 
dom. If the input phrase is not a voice of an authenti- 
cated user, on the other hand, a response phrase 
saying: "A strange person" or "Who are you?" Is output 
at random. 

[0069] Rule 2 shown in Fig. 8 prescribes response 
phrases to an input phrase "Good day" or "Hello." As 
described above, the response phrases are based on 
the "anger", "sadness", "joy", "fear", "surprise" and 
"hatred" emotions of the emotion data. To be more spe- 
cific, a response phrase saying: "Shut up". "What?", 
"Howdy", "1 am surprised", "Hi", or "Did you call me?" is 
selected as an output if the largest among the "anger", 
"sadness", "Joy", "fear", "surprise" and "hatred" emotion 
parameters respectively exceeds a predetemiined 
value. 

[0070] The statement 'authenticated (A); (B)' in 
Rule 1 shown in Fig. 8 means that if a result of user 
authentication or the like to be described later is set at a 
Boolean value of TRUE", the phrase (A) is selected 
and If the result of the user authentication or the like is 
not set at "TRUE", on the other hand, the phrase (B) is 
selected. The statement "random ("A", "B")" means that 
either the phase "A" or "B" is selected at random. 
[0071] By the way. the "joy" emotion parameter in 
the typical emotion data 16C shown in Fig. 7 has the 
largest value among the variables. Thus, according to 
Rule 2, the word "Howdy" for the joy emotion is 
selected. 

[0072] As the response-sentence creation module 
1 1 E creates a response based on the emotion data 1 6C 
described above, depending on the input key phrase, 
the response-sentence creation module 1 1 E also cre- 
ates a response based on the physical-condition data 
16B or a combination of the emotion data 16C and the 
physical-condition data 1 SB as mentioned earlier. With 
such a response-sentence creation module 1 1 E, when 
the electronic pet is in an unsatisfactory physical condi- 
tion, the electronic pet apparatus 1 thus generates a 
response corresponding to the condition. 
[0073] The response-sentence creation module 
1 1 E records a generated response to such a result of 
voice recognition in a conversation history 16F. If neces- 
sary, the response-sentence creation module 1 1 E gen- 
erates a response by referring to the conversation 
history 16F In this way, an unnatural conversation 
between the electronic pet and the user can be avoided. 
In addition, the response-sentence creation module 
11E also generates a response by referring to a knowl- 
edge base 16G. As a result, the electronic pet appara- 
tus 1 is capable of changing the response in 
dependence on the user which is identified typically by 
carrying out processing to authenticate the user 
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[0074] A voice synthesis module 11 F searches 
voice data 16H for voice data DB corresponding to a 
response output by the response-sentence creation 
nnodule 11 E, outputting the voice data DB as a search 
result. As shown in Fig. 9, the voice data 1 6H is a collec- 
tion of voice files each corresponding to a response. For 
example, when the response "Howdy" is output, a voice 
file named 'voice0005.wav' is selected and voice data 
DB recorded in this voice file is output. 
[0075] A picture synthesis module 11G searches 
picture data 161 for picture data DV corresponding to a 
response output by the response-sentence creation 
module 1 1 E, outputting the picture data DV as a search 
result. As shown in Fig. 1 0, the picture data 1 61 is a col- 
lection of picture-data files each corresponding to a 
response. For example, when the response "Howdy* is 
output, a picture-data file named 'fig0005.bmp' is 
selected and picture data DV recorded in this picture- 
data file is output. 

I -2. Connection to the Network 

[0076] The central processing unit 1 1 executes a 
processing procedure shown in Fig. 1 1 to connect the 
electronic pet apparatus 1 to the network 1 8 through the 
network connection unit 17. Connected to the network 
1 8, the electronic pet apparatus 1 is capable of transmit- 
ting the physical-condition data 16B, the emotion data 
16C and the conversation history 16F to desired equip- 
ment by way of the network 18. The equipment receiv- 
ing such data from the electronic pet apparatus 1 is 
capable of reproducing the electronic pet of the elec- 
tronic pet apparatus 1 . In this way, the electronic pet can 
be taken out to a variety of environments. 
[0077] On the contrary, the centi;al processing unit 

I I is capable of acquiring physical-condition data 168, 
emotion data 16C and a conversation history 16F from 
the network 18, allowing an electronic pet raised in 
another electronic pet apparatus to be brought out to 
this electronic pet apparatus 1. In this case, the func- 
tional blocks of the electronic pet apparatus 1 are capa- 
ble of carrying out the processing based on the native 
physical-condition data 168, the native emotion data 
1 6C and the native conversation history 1 6F to emulate 
its own electronic pet raised by itself in parallel to 
processing based on the physical-condition data 168, 
the emotion data 16C and the conversation history 16F 
imported from the other electronic apparatus through 
the network 1 B to emulate another electronic pet raised 
in the other electronic apparatus to produce responses 
as if the other electronic pet were visiting this electronic 
pet apparatus 1 . It should be noted that, in the functional 
blocks shown in Fig. 1, flows of data in the processing 
based on the physical-condition data 168, the emotion 
data 16C and the conversation history 16F acquired 
from the other electronic apparatus through the network 
18 are not shown. 

[0078] The electronic pet unit 1 is also capable of 



acquiring recognition data 16A, pattern data 16E, a 
knowledge base 16G, voice data 16H and picture data 
161 from the network 18 to increase the size of the 
vocabulary of spoken words that can be recognized by 
5 the so-called electronic pet and to increase the number 
of response types. As a result, the electronic pet appa- 
ratus 1 is capable of raising and teaching the electronic 
pet. 

[0079] As shown in Fig. 1 1 , the procedure begins 
10 with a step SP1. In response to a request for connec- 
tion, the flow of the procedure goes on to a step SP2 at 

which the central processing unit 11 accepts the 
request. It should be noted that such requests for con- 
nection are generated periodically by the timer 1 1 B at 

15 fixed intervals. In addition, a request for connection can 
be made by the user by operating an operator. Further- 
more, a connection can also be established in response 
to an incoming call from the network 18. 
[0080] The flow of the procedure then goes on to a 

20 step SP3 at which the central processing unit 1 1 estab- 
lishes a communication by carrying out predetemnined 
line connection processing. Then, the flow of the proce- 
dure proceeds to a step SP4 at which the central 
processing unit 11 exchanges various kinds of data 

25 depending on the substance of the request for connec- 
tion with a communication partner. Subsequently, the 
flow of the procedure proceeds to a step SP5 at which 
the central processing unit 11 cuts off the communica- 
tion. Finally, the flow of the procedure proceeds to a 

30 step SP6 at which the central processing unit 1 1 ends 
the processing procedure. 

[0081] Fig. 12 is a diagram showing the format of 
transferred data. The electronic pet apparatus 1 
exchanges data with a communication partner by way of 

35 an interface included in the network connection unit 1 7 
and an interface in the communication partner in 
accordance with the fomriat. As shown in the figure, 
each piece of data DT has a header for describing infor- 
mation such as the address and the type of the data DT 

40 Typically, the data DT includes pattern data 16E, recog- 
nition data 16A, voice data 16H. picture data 161 and so 
on, which are arranged sequentially, as necessary. 

1-3. Cataloging Recognition Data 

45 

[0082] Fig. 1 3 is a functional block diagram showing 
the electronic pet apparatus 1 in more detail in an oper- 
ation to catalog recognition data 1 6A. In this functional 
block diagram, a cataloging module 111 catalogs a 

50 result of voice recognition as recognition data 16A. In 
this way, it is possible to teach the electronic pet a vari- 
ety of words orally without entering the words via an 
input unit such as a keyboard. 
[0083] In order to accomplish the purpose 

55 described above, the voice recognition module 11A 
processes voice data DA by adoption of the HMM 
method, outputting a series of phonemes as a result of 
voice recognition. To put it in detail, a voice expressed in 



10 



19 



EP1 072 297 A1 



20 



the Japanese language is analyzed to identify its pho- 
nemes which are each indicated by an identifier Thus, 
a pronunciation in the Japanese language can be 
expressed by an array of identifiers. The identifiers are 
listed as follows: 'b', 'd'. 'g', 'p', 't. 'k\ 'm', 'n'. V, 'z', 'ch', 
Is', y, V, 'h\ 'r, 'e', 'a', 'o\ 'u'. 'N'. 'ei', 'ou', 's', 'sh'. 'xy', 
•j', T, and 'sir. The phoneme 'sil' is soundless. 
[0084] When the user says: "milon" ("oranges", in 
English) as an input, for example, the voice recognition 
module 11 A recognizes the voice input as a series of 
phonemes which are expressed by identifiers 'sil m i k a 
N sir. The voice recognition module 11A sequentially 
processes the voice data DA supplied thereto also 
sequentially to identify its phonemes. Results of recog- 
nition are then processed according to syntax shown in 
Fig. 1 4 to detect a series of phonemes represented by a 
series of identifiers. It should be noted that the syntax 
shown in Fig. 1 4 is syntax indicating pennitted connec- 
tions of all the phonemes listed above. 
[0085] In a nonTial operating mode, the video rec- 
ognition module 1 1 A searches the recognition data 16A 
for text data including a word or a phrase obtained as a 
search result con^esponding to an array of identifiers 
detected in this way, outputting the text data as a result 
of recognition. Thus, when a word not cataloged in the 
recognition data 16A is received from the user as a 
voice input in this embodiment, it will be difficult to gen- 
erate text data and it is hence hard to give a correct 
response to a voice input given by the user. 
[0086] In order to solve this problem, the electronic 
pet apparatus 1 implemented by this embodiment is 
connected to the network 1 8 by the network connection 
unit 17, being capable of downloading recognition data 
16A from the network 18. In this way, the downloaded 
recognition data 16A is taught to the electronic pet so 
that the electronic pet is capable of giving responses to 
a variety of sayings. 

[0087] In addition, in this embodiment, the central 
processing unit 11 executes a processing procedure 
shown in Fig. 15 when a catalog mode is selected by 
the user. During the execution of processing procedure, 
the user is requested to operate the confirm operator 4A 
and the cancel operator 4B as described below. The 
procedure is executed to catalog a word said by the 
user into the recognition data 1 6 A. 
[0088] As shown in Fig. 15, the procedure begins 
with a step SP1 1 . When a predetermined operator is 
operated, the flow of the procedure goes on to a step 
SP12 to enter a catalog mode in which the central 
processing unit 11 executes the picture synthesis mod- 
ule 1 1 G to display a predetermined message on the liq- 
uid-crystal display panel 3. The message requests the 
user to pronounce a word. 

[0089] Then, the flow of the procedure proceeds to 
a step SP14 at which the central processing unit 1 1 car- 
ries out voice recognition on the voice data DA received 

sequentially, identifying the data DA sequentially as a 
series of phonemes. As the user operates a predeter- 



mined operator to end the voice input, the flow of the 
procedure goes on to a step SP15. 
[0090] At the step SP1 5, the central processing unit 
1 1 executes the voice synthesis module 11 F In accord- 

5 ance with the series of phonemes obtained as a result 
of voice recognition to reproduce the voice received 
from the user. In this way. the result of voice recognition 
can be presented to the user. Assume that the user 
says the word "mikan". In this case, the central process- 

10 ing unit 1 1 produces a phoneme array of 'sil m i k a N sil' 
as a result of voice recognition and the voice synthesis 
module 1 1 F generates a sound saying: "Is it a mikan?" 
The flow of the procedure then goes on to a step SP16 
at which the central processing unit 1 1 accepts a signal 

15 entered by the user by operating the confirm operator 
4A or the cancel operator 48 in response to the gener- 
ated query sound. 

[0091 ] The flow of the procedure then goes on to a 
step SP1 7 at which the central processing unit 1 1 forms 

20 a judgment as to whether the confirm operator 4A or the 
cancel operator 4B has been operated by the user. If the 
cancel operator 4B has been operated by the user, the 
central processing unit 1 1 determines that the result of 
voice recognition presented to the user has been 

25 denied. In this case, the flow of the procedure goes back 
to the step SRI 3 to again accept a voice input. If the 
confirm operator 4A has been operated by the user, on 
the other hand, the central processing unit 1 1 deter- 
mines that the result of voice recognition presented to 

30 the user has been accepted. In this case, the flow of the 
procedure goes on to a step SP1 8. 
[0092] At the step SP1 8, the central processing unit 
1 1 again executes the picture synthesis module 11G to 
display a predetermined message on the liquid-crystal 

35 display panel 3. The message requests the user to say 
an attribute for the word said earlier as a voice input. An 
attribute is a keyword showing the property of an object 
identified by a word. An attribute is used for classifying 
an object. In the case of the word "mikan", for example, 

40 an attribute "fruit" is said by the user to determine the 
category of the word "mikan". 
[0093] The flow of the procedure then goes on to a 
step SP19 at which the central processing unit 1 1 car- 
ries out voice recognition on the voice data DA received 

45 sequentially, identifying the data DA sequentially as a 
series of phonemes. As the user operates a predeter- 
mined operator to end the voice input, the flow of the 
procedure goes on to a step SP20. 
[0094] At the step SP20, the central processing unit 

50 1 1 executes the voice synthesis module 1 1 F in accord- 
ance with the series of phonemes obtained as a result 
of voice recognition to reproduce the voice received 
from the user. In this way, the result of voice recognition 
carried out on the attribute can be presented to the user. 

55 Assume that the user says the word Iruit" as an 
attribute after saying the word "mikan". In this case, the 
voice synthesis module 1 1 F generates a sound saying: 
"Is it a fruit?" The flow of the procedure then goes on to 
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a step SP21 at which the central processing unit 1 1 
accepts a signal entered by the user by operating the 
confirm operator 4A or the cancel operator 4B in 
response to the generated query sound. 
[0095] The flow of the procedure then goes on to a 
step SP22 at which the central processing unit 1 1 forms 
a judgment as to whether the confirm operator 4A or the 
cancel operator 4B has been operated by the user. If the 
cancel operator 4B has been operated by the user, the 
central processing unit 1 1 determines that the result of 
voice recognition presented to the user has been 
denied. In this case, the flow of the procedure goes bacl< 
to the step SP18 to again accept a voice input. If the 
confirm operator 4A has been operated by the user, on 
the other hand, the central processing unit 11 deter- 
mines that the result of voice recognition presented to 
the user has been accepted. In this case, the flow of the 
procedure goes on to a step SP23. 
[0096] At the step SP23, the central processing unit 
1 1 catalogs the word 'mikan' into the recognition data 
16A and the attribute 'fruit' into the l<nowledge base 
16G. The flow of the procedure then proceeds to a step 
SP24 to end the whole processing. 
[0097] The knowledge base 16G is recorded 
attributes such as the word fruit and the word drinl< 
showing classification of words and phrases cataloged 
in the recognition data 16A. Pattern data 16E is also 
recorded attributes which make the central processing 
unit 1 1 capable of asking the user for example a ques- 
tion: "What food do you like?" In response to this ques- 
tion, let the user answer: "I like mikan (oranges)." Then, 
in response to the answer given by the user, the central 
processing unit 1 1 for example makes a comment: "I 
don't like mikan (oranges)." 

[0098] In addition to attributes, the knowledge base 
16G also includes the name and favorites of the keeper 
or the owner of the electronic pet apparatus 1 as well as 
various kinds of data such as a weather forecast 
received from the network 18. If necessary, this data 
can be utilized in a conversation with the user. When the 
user asks a question: "What is today's weather fore- 
cast?", for example, the electronic pet apparatus 1 is 
capable of giving an answer: "A clear weather" in con- 
formity with a predetermined rule using the words 
'today' and 'weather' as key phrases. 
[0099] In an operation to catalog a voice input into 
the recognition data 16A in the electronic pet apparatus 
1 as described above, a con'ect text for the voice input 
has to be verified not to already exist in the recognition 
data. In the above example, the correct text is a text 
describing the word "mikan." Text data obtained as a 
result of voice recognition is an array of alphabetical 
marks or an array of identifiers representing a series of 
phonemes representing a word or a phrase entered by 
the user as a voice input. In the above example, the 
array of alphabetical marks is 'sil m i k a N sir describing 
a word or a phrase to be cataloged into the recognition 
data 1 6A. If necessary, a text downloaded from the net- 



work 1 8 can also be cataloged into the recognition data 
1 6A. With such a text cataloged in the recognition data 
1 6A, a response may be generated from a recorded text 
in place of identifiers con-esponding to a series of pho- 

5 nemes obtained as a result of voice recognition. 

[0100] In the electronic pet apparatus 1 , recognition 
data 16A of a word or a phrase cataloged as a result of 
recognition of a voice input is processed in the same 
way as recognition data 16A of a word or a phrase 

10 downloaded from the network 18 and recognition data 
16A of a word or a phrase cataloged in advance, allow- 
ing a conversation to be held with the user. 

1 -4. User Authentication 

15 

[0101] Fig. 16 is afunctional block diagram showing 
the electronic pet apparatus 1 in an operation to authen- 
ticate the user in more detail. In this functional block dia- 
gram, authentication data 16K includes a user name 
20 recorded in advance. It should be noted that the user 
name is recorded as a result of voice recognition. 
Instead of obtaining the user name as a result of voice 
recognition, the user name can be entered via the key- 
board of an external apparatus in initial setting process- 
es ing which is typically carried out when the electronic pet 
apparatus 1 is purchased. 

[0102] The response-sentence creation module 
11E returns for example an answer saying: "Are you 
really the master?" in response to a key phrase saying: 
30 "Gao" in accordance with Rule 1 of the pattern data 1 6E 

shown in Rg, 17. 

[0103] In accordance with Rule 2, a voice authenti- 
cation module 11J sets a Boolean value 'authenticated' 
at 'TRUE" (described as 'set authenticated (TRUE)' in 
35 Rule 2) if the following 2 conditions are satisfied: 

a key phrase '$USER' defined as a user name and 
cataloged in advance is entered as a voice input; 
and 

40 a response including a phrase saying: "Are you 
really the master?" is generated by the response- 
sentence creation module 1 1 E immediately before 
the voice input '$USER' as myLastUtter. 

45 [0104] It should be noted that the function 
set_authenticated (TRUE) cited above sets the 
Boolean-value 'authenticated' at TRUE. 
[0105] To put it in detail, the voice recognition mod- 
ule 1 1 J searches the authentication data 16K for a user 

50 name matching a result of recognition of the voice input. 
If such a name is found in the search, a person entering 
the voice input is authenticated as the user and an 
authenticated state 1 6J is set at an authenticated user 
state. If such a name is not found In the search, on the 

55 other hand, a person entering the voice input is not 
authenticated as the user and the authenticated state 
16J is set at an un authenticated user state. 
[0106] If the user is authenticated, the response- 
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sentence creation module 1 1 E generates a response 
saying: "Yes, you are the nnaster" in accordance witli 
Rule 2. 

[01 07] As described above, the electronic pet appa- 
ratus 1 is capable of authenticating a user on the basis 
of a voice input and giving the owner a response differ- 
ent from a response given to a user other than the 
owner as shown in Fig. 8. In general, the electronic pet 
typically displays a behavior special for the owner as an 
actual pet does. 

[0108] Also as described above, a user is authenti- 
cated by comparing a voice input with a word cataloged 
in advance. 

[0109] In addition to the name of the user who has 
been authenticated, the voice recognition module 1 1 J 
may also request the response -sentence creation mod- 
ule 11 E to output a response making an inquiry about 
the favorite or the hobby of the user to be recorded in 
the knowledge base 16G in a conversation with the user 
who has been authenticated as the owner as shown in 
Fig. 18. In the example shown in the figure, the 
response making an inquiry says: "What is your favorite 
food, master?" This question asks the favorite food of 
the voice generator who has been authenticated as the 
owner. 

[0110] In response to this query, the user says: 
"Peanuts" as shown in Fig. 1 8. The word peanuts is sub- 
jected to a voice recognition process in the voice recog- 
nition module 11J and processed in the same way as 
Rule 2 shown in Rg. 1 7 to judge by the user's voice 
input, a response to the inquiry about a favorite. The 
word "peanuts" is then cataloged in the authentication 
data 16K. 

[0111] During a conversation with a person entering 
a voice input, the response-sentence creation module 
1 1 E generates an inquiry about a favorite, a hobby or 
the like cataloged in advance in the authentication data 
1 6K as one shown in Fig. 1 9 when invoked by the timer 
11B. In the case of the favorite food cataloged in the 
authentication data 16K as shown in Fig. 18, for exam- 
ple, the response-sentence creation module 1 1 E gener- 
ates an inquiry: "Are you really the master? What is your 
favorite food?" as shown in Fig. 19. 
[0112] The voice recognition module 11J deter- 
mines whether or not a voice input given by the user in 
response to the inquiry about the favorite food is true by 
carrying out the same processing as the one according 
to Rule 2 explained earlier by refen^ing to Fig. 17. Since 
the user is the owner in this case, a voice-input 
response of "Peanuts" is obtained. From a result of 
voice recognition of this response, an authentication 
state is set at a Boolean value of "TRUE". In addition, 
the response-sentence creation module 1 1 E generates 
a response of "You are really my master!" 
[0113] In this way, the electronic pet apparatus 1 is 
capable of forming a judgment on a result of voice rec- 
ognition based on a result of voice recognition obtained 
in the past. To put it in detail, the electronic pet appara- 



tus 1 is capable of making an inquiry about a result of 
voice recognition obtained in the past in response to the 
user's input during a conversation with the user, and 
fomning a judgment on a result of voice recognition of 

5 another voice input given In response to the inquiry in 
order to authenticate the user. 
[0114] In addition, when the user does not give a 
voice input in response to an inquiry made by the 
response-sentence creation module 1 1 E as triggered 

10 by the timer 1 1 8 even after a predetemriined period of 
time has lapsed, the voice recognition module 11J 
assumes that the user has typically terminated opera- 
tions of the electronic pet apparatus 1, resetting the 
authentication state. 

15 

1-5. Processing to Classify Conversations 

[01 1 5] Fig. 20 is a functional block diagram showing 
the electronic pet apparatus 1 in processing to classify 
20 conversations in more detail. In this functional block dia- 
gram, a word/phrase classification module 1 1 M identi- 
fies a result of voice recognition to classify 
conversations entered as a voice input in confomnity 
with a predetermined classification rule 1 6M, outputting 
25 a classification code to the response-sentence creation 
module 1 1 E as a result of classification. 
[0116] For example, the word/phrase classification 
module 1 1M classifies voice inputs of general greetings 
such as "Good morning" and "Good day" Into a "greet- 
so ing" category. Voice inputs of inquiries such as "How are 
you?" and "What do you like?" are classified into an 
"inquiry" category. Voice inputs of impressions such as 
"I am fine" and "Bored" are classified into an "impres- 
sion" category. 

35 [0117] In an operation to create a response sen- 
tence according to the pattern data 1 6E, the response- 
sentence creation module 11E forms a response 
according to response-sentence categories recorded in 
the pattern data 16E and a category pattern classified 

40 by the word/phrase classification module 1 1 M. In addi- 
tion, a response is created also in accordance with past 
conversation records stored the conversation history 
16R 

[0118] The pattern data 16E includes rules to be 
45 followed to classify response sentences as shown in 
Fig. 21 . The rules have the same syntax of comparison 
as the rules shown in Fig. 8. It should be noted that the 
classification rules shown in Fig. 21 are set for classifi- 
cation to be carried out by the word/phrase classifica- 
50 tion module 1 1M. 

[0119] According to Rule 1 shown in Fig. 21, the 
phrases saying: "I love you, too" and "Wow, I am a male 
though" are classified into a "state" category, a phrase 
saying: "A strange person" is classified into the "impres- 
55 sion" category and a phrase saying: "Who are you?" is 
classified into the "query" category. According to Rule 2, 
a phrase saying: "Shut up" is classified into the "impres- 
sion" category, a phrase saying: "What?" is classified 
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into the "query" category, a phrase saying: "Howdy" is 
classified into the "greeting" category and a phrase say- 
ing: "I am surprised" is classified into the "impression" 
category, a phrase saying: "Hi" is classified into the 
"greeting" category and a phrase saying: "Did you call 5 
me?" Is classified Into the "query" category. 
[0120] In addition, the pattern data 16E also pre- 
scribes a sort of restriction that prohibits a conversation 
from comprising consecutive specific categories. To put 
it concretely, the restriction is set so that no inquiry shall 
be returned in response to an inquiry made by the user. 
Furthermore, after 2 consecutive "greetings" are 
exchanged between the electronic pet apparatus 1 and 
the user, the electronic pet apparatus 1 shall not again 
issue a "greeting" as stipulated in a restriction shown at 
the bottom of Rg. 21. 

[0121] A conversation history 1 6F shown in Fig. 22 
or 23 is made by the response-sentence creation mod- 
ule 11E to record a conversation between the electronic 
pet and the user. As shown in the figures, a history 
includes an action taking party generating a voice, the 
category of the voice and the contents of the voice. The 
history shown in Fig. 22 describes the user as a first- 
action taking party, the "greeting" category as a type of 
the voice generated by the first-action taking party and 
a phrase saying: "Good day" of the "greeting" category 
as contents of the voice generated by the first-action 
taking party. The user is followed by the electronic pet 
as a second-action taking party. The type of the voice 
generated by the second-action taking party is also the 
"greeting" category and the contents of the voice gener- 
ated by the first-action taking party are a phrase saying: 
"Hi". The electronic pet is followed by the user as a third- 
action taking party. The type of the voice generated by 
the third-action taking party is the "query" category and 
the contents of the voice generated by the third-action 
taking party are a phrase saying: "How are you doing?" 
The user is followed by the electronic pet as a fourth 
action-taking party. The type of the voice generated by 
the fourth -action taking party is the "state" category and 
the contents of the voice generated by the fourth-action 
taking party are a phrase saying: "I am fine". 
[0122] When the response-sentence creation mod- 
ule 11 E creates a response in accordance with the pat- 
tern data 1 6E and on the basis of the emotion data 1 6C, 
a conversation history 16F is used as a reference and 
restrictions prescribed in the pattern data 16E are 
abided with. For example, after 2 consecutive greetings 
are exchanged between the electronic pet apparatus 1 
and the user, the response-sentence creation module 
1 1 E shall not again issue a greeting by applying Rule 2 
right after the 2 consecutive ones as stipulated in the 
restriction shown in Fig. 21 even if the "joy" emotion 
parameter has a largest value among the emotion vari- 
ables. In addition, no "inquiry" shall be returned In 
response to an "inquiry" made by the user. 
[0123] By abiding with the restriction on greetings 
described above, even if a first rule stipulates that a 



greeting shall be returned In response to a greeting and 
a second rule stipulates that a greeting shall be 
returned in response to a variety of inquiries, It is possi- 
ble to avoid an unnatural conversation comprising greet- 
ings exchanged between the user and the electronic pet 
repeatedly a number of times due to repetitive applica- 
tion of the first and second rules described above. 

1-6. Emotion Control 

[01 24] Fig. 24 is a functional block diagram showing 
the electronic pet apparatus 1 in execution of emotion 
control in more detail. In this functional block diagram, 
an emotion changing module 11 D is activated by the 
timer 1 1 B described earlier to search the character data 
16D by using a word included in a result of voice recog- 
nition as a keyword for variances conresponding to the 
word, and updates the emotion data 1 6C by using the 
variances found in the search. 
[0125] In this processing, the emotion changing 
module 11D records changes in variables composing 
the emotion data 16C, text data obtained as a result of 
voice recognition of the user's input and keywords each 
included in the text data and used for searching the 
character data 16D for the changes as an emotion- 
change history 16N like one shown in Fig. 25. In addi- 
tion, with predetermined timing typically after a 
response has been output, the emotion-change history 
16N is searched for a word used frequently In user 
Inputs in conjunction with a keyword. If such a word is 
found, the word is cataloged in the character data 16D 
as a new keyword as shown in Fig. 26. The character 
data 1 6D shown in Fig. 26 is obtained by cataloging a 
new keyword in the character data 16D shown in Fig. 6. 
By cataloging this word in the character data 16D as a 
new keyword, the variables of the emotion data 1 6C can 
be updated even when this word alone Is input in the 
same way as the other keywords. 
[0126] For example, assume the phrase "curry 
bread" is used in user inputs as shown in Rg. 25 in con- 
junction with the keyword "dirty" which changes the var- 
iables of the emotion data 16C a number of times 
exceeding a predetermined value. In this case, the emo- 
tion changing module 11D catalogs the phrase "curry 
bread" in the character data 16D as a new keyword as 
shown in Fig. 26. As shown in Fig. 26, the variables of 
the emotion data 16C are updated by using the same 
changes as the keyword "dirty" even when this phrase 
"curry bread" only is Input. 

[0127] As a result, the electronic pet apparatus 1 
sets a variety of parameters and variables so that a spe- 
cific emotion is resulted in by the so-called associative 
information and is hence capable of generating a 
response based on the resulting emotion. 
[0128] In addition, when the emotion changing 
module 11D searches the emotion-change history 16N 
with the predetermined timing as described above, the 
frequency of using each keyword for changing the vari- 
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ables of the emotion data 1 6C is also found out as well. 
If the frequency of using a keyword is found greater than 
a predetermined value, as shown in Fig. 27, the emotion 
changing module 11D reduces the absolute values of 
the variations in the character data 1 6D for the keyword 5 
from the original values shown in Fig. 6. In the emotion- 
change history 1 6N shown in Fig. 27, for example, the 
keyword "dirty" is used frequently. Thus, the variations 
in 6 variables, namely, the "anger", "sadness", "joy", 
"fear", "surprise" and "hatred" emotion parameters in 
the character data 16D for the keyword "dirty" are 
changed by -1,-1, +2, +1, -1 and -4 respectively. 
[0129] In this way, the electronic pet apparatus 1 is 
capable of forming the so-called sense of accustoming. 
[0130] If the frequency of using a keyword of the 
character data 1 6D in voice inputs gets lower, on the 
other hand, the emotion changing module 11D elimi- 
nates the keyword used in voice inputs from the charac- 
ter data 16D. As described above, if the frequency of 
using a keyword is found greater than a predetermined 
value, the emotion changing module 11D reduces the 
absolute values of the variations in the character data 
16D for the keyword. If the frequency of using the key- 
word decreases again, however, the variations are grad- 
ually restored to their original values. 
[0131] In this way, the electronic pet apparatus 1 is 
capable of creating the so-called state of forgetting 
something. 

1 -7. Operation of the First Embodiment 

[0132] In the configuration described above, the 
voice recognition module 11A employed in the elec- 
tronic pet apparatus 1 shown in Figs. 1 to 3 carries out 
a voice recognition process using the HMM method on 
a voice input entered by the user via the microphone 7. 
As described above, the voice recognition module 1 1 A 
is a functional block, the processing of which is carried 
out by the central processing unit 1 1 . In the voice recog- 
nition processing, a voice is first converted into a series 
of phonemes which are then transformed into text data 
by referring to the recognition data 16A. 
[0133] In the electronic pet apparatus 1 , text data 
obtained as a result of voice recognition carried out in 
this way is supplied to the physical-condition changing 
module 11 C which changes the 5 elements of the 
present physical condition, namely, the "fatigue", "hun- 
ger", "thirstiness", "sickness" and "sleepiness" parame- 
ters of the physical-condition data 1 6B shown in Fig. 4, 
in accordance with a word included in a voice input. 
When food has been given as indicated by a result of a 
voice recognition, for example, the "hunger" parameter 
is decreased and, when a drink is received as indicated 
by a result of a voice recognition, for example, the 
"thirstiness" parameter is decreased. 
[0134] In this way, the electronic pet apparatus 1 is 
capable of changing the physical condition by a voice 
input entered by the user. In addition, the 5 parameters 



can also be changed gradually by processing candied 
out by the physical-condition changing module 1 1C and 
based on the timer 11B. Thus, in the electronic pet 
apparatus 1. the physical condition expressed in temns 
of these parameters is modified by a voice input entered 
by the user and changes with the lapse of time. As a 
result, by generating a response based on the 5 param- 
eters to a voice input, the physical condition of the elec- 
tronic pet is reflected in the response to the voice input. 
[0135] In addition, the result of voice recognition is 
supplied also to the information changing module 11D 
which changes the emotion data 1 6C shown in Fig. 5 in 
accordance with a word included in a result of voice rec- 
ognition. Changes in emotion data 16C are described in 
character data 16D. The 6 variables expressing the 
emotion are updated in accordance with keywords and 
the character data 1 6D. To put it in detail, keywords are 
the words for changing emotions of the electronic pet, 
while, as shown in Fig. 6, the character data 1 6D com- 
prises variations in 6 variables expressing the emotion, 
namely, the "anger", "sadness", "joy", "fear", "surprise" 
and "hatred" parameters, for a variety of keywords, that 
is, words included in voice inputs. That is to say, the 
emotion is changed in accordance with a voice input 
entered by the user. 

[0136] In this way, the electronic pet apparatus 1 
changes the emotion of the electronic pet in accordance 
with a voice input given by the user. In addition, since 
the electronic pet apparatus 1 creates a response to a 
voice input in accordance with a result of recognition of 
the voice input on the basis of the physical-condition 
data 1 6B and the emotion data 16C, the response of the 
electronic pet reflects the physical condition and the 
emotion of the electronic pet. 

[0137] To put it in detail, in the electronic pet appa- 
ratus 1 , a result of voice recognition is supplied to the 
response-sentence creation module 1 1 E which creates 

a response sentence for the result of voice recognition 
in accordance with rules described in the pattern data 
16E as shown in Fig. 8. To put it in detail, in the elec- 
tronic pet apparatus 1 , the pattern data 1 6E describes a 
response sentence for each key phrase included in the 
voice input. The response-sentence creation module 
1 1 E searches the pattern data 1 6E for a response sen- 
tence associated with the key phrase obtained as a 
result of voice recognition, outputting the response sen- 
tence as a search result. 

[0138] In the electronic pet apparatus 1, a actual 
response corresponding to the response sentence is 
generated by the voice synthesis module 1 1 F and out- 
put to the speaker 6. Files each containing the voice for 
each response are shown in Fig. 9. On the other hand, 
a picture associated with the actual response is created 
by the picture synthesis module 1 1G to be displayed on 
the liquid-crystal display panel 3. Files each containing 
the picture for each response are shown in Fig. 10. In 
this way, a actual response to a voice input entered by 
the user is presented to the user as a voice and a pic- 
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ture. 

[0139] Since a response is created in the electronic 
pet apparatus 1 from the pattern data 16E comprising 
rules for generating different responses in accordance 
with the physical-condition data 16B and the emotion 5 
data 16C, the response to the user reflects the physical 
condition and the emotion of the electronic pet. 
[0140] In the processing sequence described 
above, the electronic pet apparatus 1 is capable of 
switching the operation to a cataloging operation mode 
in conformity with a predetemfiined operator carried out 
by the user In this mode, a word and a phrase that can 
be recognized in a voice recognition process are catal- 
oged into the recognition data 16A. 
[0141] To put it in detail, when a voice representing 
a word or the like to be cataloged is received as an input 
from the user in this cataloging mode, the electronic pet 
apparatus 1 carries out the same processing as that in 
the normal operating mode shown in Figs. 13 to 15 to 
convert the voice input into a series of phonemes in 
accordance with the syntax shown in Fig. 14. A voice 
represented by this series of phonemes is then gener- 
ated by the voice synthesis module 1 1 F to be confirmed 
by the user. After the user confirms that the result of 
voice recognition is correct, the user enters another 
voice input representing the attribute of the confirmed 
word or the like. 

[0142] The electronic pet apparatus 1 also converts 
the voice input representing the attribute Into a series of 
phonemes. If a voice generated from this series of pho- 
nemes is also confirmed by the user, the series of pho- 
nemes representing the word or the like entered earlier 
is cataloged into the recognition data 1 6A while the data 
of the attribute is cataloged into the knowledge base 
16G, being associated with the word or the like catal- 
oged in the recognition data 16A. 
[01 43] As described above, the electronic pet appa- 
ratus 1 is capable of cataloging words and the like 
entered as a voice input without carrying out difficult 
operations on an Input unit such as a keyboard, allowing 
the degree of freedom to use the apparatus 1 to be 
raised commensurately. In addition, it is possible to 
make the word vocabulary larger to nurture the elec- 
tronic pet as if the user were actually training a real pet. 
As a result, the electronic pet can be made familiar and 
easy to get acquainted with commensurately. 
[0144] As described above, in a normal voice rec- 
ognition process, the recognition data 16A is searched 
for text data corresponding to a series of phonemes 
obtained as a result of voice conversion and the text 
data is output as a result of voice recognition used in 
creation of a response sentence. The text data found in 
the search may be a word or the like cataloged in the 
cataloging mode described above. Such text data 
described by a series of phonemes can also be used in 
creation of a response sentence in place of text data 
usually found in the normal voice recognition process. 
Creation of a response sentence is .also based on an 



attribute recorded in the knowledge base 16G. Thus, 
when the physical-condition data 16B indicates that the 
electronic pet is hungry and the input received from the 
user has a food attribute, for example, the electronic pet 
apparatus 1 is capable of generating a response stating 
typically: "I want to eat" or "l want some food.' 
[0145] As described above, a word and the attribute 
of the word are received as separate voice inputs and, 
after the results of voice recognition of the voice inputs 
are confirmed by the user, the word and the attribute are 
cataloged. In this way, since a word and the attribute of 
the word are entered by the user separately as voice 
inputs and their results of voice recognition are con- 
firmed by the user, it is possible to catalog the word and 
the attribute with ease and a high degree of reliability. 
[0146] When the user enters a voice input saying: 
"Gao", on the other hand, the electronic pet apparatus 1 
carries out the processing represented by the functional 
block diagram shown in Fig. 1 6, using the input voice as 
a keyword for generating a voice based on Rule 1 
shown in Fig. 17 in order to request the user to enter 
infonnation cataloged in advance such as the name of 
the user. A voice input entered by the user in response 
to this request is subjected to a voice recognition proc- 
ess. The voice recognition module 1 1J employed in the 
electronic pet apparatus 1 compares a result of the 
voice recognition process with the recognition data 16K. 
If the outcome of the comparison authenticates the 
user, the authentication state 1 6J is set to indicate that 
the person entering the voice is the owner 
[0147] The response-sentence creation module 
11 E of the electronic pet apparatus 1 creates a 
response sentence based on a rule of the pattern data 
16E or Rule 1 of Fig. 8 which distinguishes a person 
other than the owner entering a voice input from the 
owner To be more specific, the response-sentence cre- 
ation module 11 E refers to the authentication state 16J 
and creates different responses depending on the value 
of the authentication state 16J. 
[01 48] Thus, the electronic pet apparatus 1 is capa- 
ble of responding by displaying a special behavior to the 
owner as a real pet does, allowing the electronic pet to 
be made easy to get acquainted with commensurately 
[0149] In addition, in the electronic pet apparatus 1 , 
the timer 1 1 B activates the voice authentication module 
11J to carry out processing of user authentication at 
predetermined intervals. In the user authentication 
processing which is carried out at predetermined inter- 
vals, the voice authentication module 1 1 J fomns a judg- 
ment as to whether or not the user is the owner As 
shown by a typical conversation of Fig. 1 9, the judgment 
is based on a voice input entered by the user in 
response to an inquiry about the favorite, the hobby or 
the like of the user which was recorded in the knowl- 
edge base 1 6G as shown by a typical conversation of 
Fig. 18. In this way, processing to authenticate the user 
can be carried out. 

[0150] Thus, the electronic pet apparatus 1 is capa- 
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ble of creating a response showing a special behavior to 
the owner by verifying the conversation partner to be the 
user in the course of conversation when necessary. 
[0151] In the course of a conversation, the 
word/phrase classification niiodule 1 1 M employed in the 
electronic pet apparatus 1 classifies a voice input into a 
"greeting" or "query" category or the like in processing 
shown in Fig. 20 in accordance with the word/phrase 
classification rule 16M by referring to a conversation 
history like one shown in Fig. 22 or 23 if necessary. In 
addition, a response to a voice input said by the user is 
created by following a category restriction described in 
the pattern data 16E shown in Fig. 21. If a voice input 
said by the user is classified into the "query" category, 
for example, the classification restriction does not allow 
a response to be generated even If a key phrase corre- 
sponding to the voice input in a rule stipulates that a 
query be generated as a response to the voice input. 
[0152] Thus, the electronic pet apparatus 1 is capa- 
ble of avoiding an unnatural conversation in which a 
query is returned in response to a query raised by the 
user. 

[0153] In addition, the electronic pet apparatus 1 
records categories of consecutive words or phrases in a 
continuous-conversation history 16F. A response to a 
voice input said by the user is created by referring to 
categories recorded in the continuous-conversation his- 
tory 16F and by considering a category restriction 
described in the pattern data 16E shown in Rg. 21. As 
a result, when the user enters a greeting following a 
greeting said by the electronic pet apparatus 1 , the elec- 
tronic pet apparatus 1 does not generate another greet- 
ing in response to the user's greeting in accordance 
with the category restriction even if a greeting-to-greet- 
ing rule stipulates that a greeting shall be generated in 
response to a greeting. 

[0154] Thus, the electronic pet apparatus 1 is capa- 
ble of avoiding an unnatural conversation in which 
greetings are exchanged a number of times forever, 
allowing the electronic pet to be made a familiar thing. 
[0155] In addition, the electronic pet apparatus 1 
also carries out processing shown In the functional 
block diagram of Fig. 24 to record changes in variables 
composing the emotion data 1 6C, text data obtained as 
a result of voice recognition of the user's input and key- 
words each included in the text data in the emotion- 
change history 16N like the one shown in Fig. 25. In the 
electronic pet apparatus 1 , the emotion-change history 
16N is searched for a word used frequently in user 
inputs in conjunction with a keyword at predetermined 
intervals. If such a word is found, the word is cataloged 
in the character data 16D as a new keyword used for 
changing the emotion data 1 6C as shown in Fig. 26. 
[0156] For example, assume that, in the electronic 
pet apparatus 1, the phrase "curry bread" is used in 
user inputs as shown in Fig. 25 in conjunction with the 
keyword "dirty" which changes the variables of the emo- 
tion data 1 6C a number of times exceeding a predeter- 



mined value. In this case, the emotion changing module 
11 D catalogs the phrase "curry bread" in the character 
data 16D as a new keyword as shown in Fig. 26. As 
shown in Fig. 26, the variables of the emotion data 16C 

5 are updated and a response is generated even when 
this phrase "curry word" alone is input by using the 
same changes as the keyword "dirty". 
[01 57] Thus, the electronic pet apparatus 1 is capa- 
ble of changing the emotion of the electronic pet by a 

10 variety of variations as an animal reacts in dependence 
on conditions and as a human being changes the emo- 
tion thereof as a result of an association process. In 
addition, the electronic pet apparatus 1 is capable of 
reflecting the variations in emotion in a response gener- 

15 ated thereby. 

[0158] In addition, when the emotion changing 
module 1 1 D employed in the electronic pet apparatus 1 
searches the emotion-change history 16N, the fre- 
quency of using each keyword for changing the varia- 

20 bles of the emotion data 16C is also checked out as 
well. If the frequency of using a keyword is found greater 
than a predetermined value, as shown in Fig. 27. the 
emotion changing module 11D reduces the absolute 
values of the variations in the character data 1 6D for the 

25 keyword. In this way, the electronic pet apparatus 1 is 
capable of forming the so-called sense of accustoming 
and the state of accustoming is reflected to the 
response. 

[0159] in the electronic pet apparatus 1 used in this 
30 way, the user Is allowed to operate the operators on the 

front panel shown in Fig. 2 to connect the apparatus 1 to 
the network 18 through the network connection unit 17 
shown in Fig. 1. With the network connection unit 17 
connected to the network 18, the electronic pet appara- 

35 tus 1 is capable of downloading information such as rec- 
ognition data 16A, knowledge base data 16G and 
pattern data 16E from the network 18, As described 
eariier, the downloaded information is effective rules 
necessary for the voice recognition processing and the 

40 response generation processing. The downloaded infor- 
mation is also used to update the recognition data 1 6A 
and the knowledge base 16G, allowing the user to enjoy 
conversations with the electronic pet at a higher level. In 
addition, it is also possible to download voice data 1 6H 

45 and picture data 161 which can be used as actual 
response outputs. In this way, expressions of responses 
can also be improved as well. 
[0160] By the same token, it is also possible to 
transmit the physical-condition data 1 6B, the emotion 

50 data 16C and the a conversation history 16F to a 
desired apparatus by way of the network 1 8. In this way, 
the recipient apparatus Is capable of reproducing the 
electronic pet of the electronic pet apparatus 1 , allowing 
the electronic pet to be taken out to a variety of environ- 

55 ments. 

[01 61] On the contrary, it is also possible to receive 
physical-condition data 168, emotion data 16C and a 
conversation history 1 6F from the network 1 8, allowing 
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the electronic pet apparatus 1 to generate a response 
as if another electronic pet were visiting the electronic 
pet apparatus 1 or as if an electronic pet raised in 
another electronic pet apparatus had been brought out 
to this electronic pet apparatus 1 . 5 

1-8. Effects of the First Enfibodiment 

[0162] According to the configuration described 
above, the recognition data used as rules of voice rec- 
ognition, the pattern data used as rules of response 
generation, the emotion data used as rule of emotion 
generation, the physical-condition data, the voice data 
and the picture data can be updated through the net- 
work, making it possible to generate almost the same 
responses as responses generated by the another 
apparatus of the same type connected to the network as 
if an electronic pet had been brought out from the other 
apparatus to this electronic pet apparatus. In addition, 
the electronic pet apparatus also makes the electronic 
pet easy to get acquainted with as a real pet in actual 
training is. Moreover, the amount of knowledge can also 
be increased by typically increasing the number of 
words that can be understood by the electronic pet if 
necessary. 

[0163] Furthermore, the recognition data can be 
updated by periodical connection to the network. Thus, 
the amount of knowledge can be increased without the 
need for the user to take the trouble to do it. 
[0164] On the contrary, the physical-condition data, 
the emotion data and a conversation history can be 
transmitted to another apparatus of the same type by 
way of the network. In this way, the other apparatus is 
capable of generating almost the same responses as 
responses to voice Inputs entered to this information 
processing apparatus such as the electronic pet appa- 
ratus, allowing the electronic pet to be treated as if the 
electronic pet had been taken out to the other appara- 
tus. As a result, the electronic pet apparatus is capable 
of making the electronic pet easy to get acquainted with 
as a real pet in actual training is. 
[0165] In addition, data can be updated and trans- 
mitted by using an IC card which is replaceable record- 
ing media. To be more specific, a new IC card is 
mounted to update data and an IC card is taken to 
another apparatus to transfer data to the other appara- 
tus. Thus, data can also be exchanged with various 
kinds of equipment with no communication function. 
[0166] Moreover, in a cataloging operation mode, a 
result of voice recognition of a word and the category of 
the word are cataloged, allowing size of the vocabulary 
of words which can be understood by the electronic pet 
to be increased with ease by voice inputs. As a result, 
the electronic pet can be treated in the same way as a 
real pet is raised in actual training and can be made 
easy to get acquainted with. 

[0167] Furthermore, at that time, on the basis of 
series of phonemes obtained as a result of voice recog- 



nition, the result of voice recognition of the word and the 
category of the word are cataloged. Thus, a word and its 
category can be cataloged by merely entering a voice 
input without carrying out other operations. 
[0168] On the top of that, a result of voice recogni- 
tion is output as text data in nomnal processing and, in a 
cataloging operation, a description of a series of pho- 
nemes is recorded. As a result, description of data such 
as rules can be simplified. 

[0169] In addition, a word and an attribute are 
treated as inputs distinguished from each other in the 
cataloging operation. As a result, the cataloging proc- 
ess can be executed with ease. 
[0170] Furthermore, a result of user authentication 
based on a voice input is used as a basis for generating 
different responses for different persons entering voice 
inputs. Thus, a response of the electronic pet for the 
owner can be made different from a response for a per- 
son other than the owner. As a result, the electronic pet 
is capable of displaying a behavior as a real pet does 
and becomes more familiar as well as easier to get 
acquainted with. 

[0171] Moreover, by using results of voice recogni- 
tion obtained in the past, a result of voice recognition 
obtained this time is examined to authenticate the user. 
In this way, the user can be authenticated by a conver- 
sation without entering a password. As a result, the 
degree of freedom to use the electronic pet apparatus 
can be raised. 

[0172] On the top of that, by using results of voice 

recognition obtained in the past, the user's response to 
an inquiry obtained this time is examined to authenti- 
cate the user or the user is authenticated by user's say- 
ing of a predetermined word. In this way, the user can 
be authenticated through a natural conversation. As a 
result, the degree of freedom to use the electronic pet 
apparatus can be raised commensurately. 
[0173] In addition, by identifying the type of a voice 
input and by generating a response other than a 
response of a predetermined type or generating a 
response of a category con-esponding to the identified 
type of the voice input, it is possible to avoid an unnatu- 
ral conversation like one in which an inquiry is made in 
response to an inquiry. In this way, a response given by 
the electronic pet can be made natural as well as lively. 
As a result, the electronic pet can be made more famil- 
iar and easier to get acquainted with. 
[0174] Furthermore, at that time, generation of a 
response by referring to a history including the types of 
input and responses can avoid an unnatural conversa- 
tion like one in which greetings are exchanged repeat- 
edly a number of times. In this way, a response given by 
the electronic pet can be made natural as well as lively. 
As a result, the electronic pet can be made more famil- 
iar and easier to get acquainted with. 
[0175] Moreover, variations in emotion parameters 
can be changed in accordance with a history of result of 
the voice recognition and con-esponding emotion 
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parameters. For a voice heard frequently, for example, it 
is possible to generate a response full of emotions of 
intimacy, familiarity and the like. In this way, a response 
given by the electronic pet can be made natural as well 
as lively. As a result, the electronic pet can be made 
more familiar and easier to get acquainted with. 
[0176] To be more specific, if a word other than a 
specific word exciting an emotion is used at the same 
times as the specific word or as frequently as the spe- 
cific word is, this frequently used word also changes the 
emotion parameters. That is to say, it is possible to gen- 
erate a response based on an emotion changed by a 
repeatedly used word in combination with the specific 
word. 

[0177] On the top of that, if a specific word among 
words exciting an emotion is used frequently, variations 
in emotion parameters are decreased. As a result, the 
so-called sense of accustoming can be formed. 

2. Effects of Other Embodiments 

[0178] In the embodiment described above, the 
electronic pet apparatus can be connected to a networl< 
to take out the electronic pet from the electronic pet 
apparatus, to generate a response of an electronic pet 
raised in another apparatus and to teach the electronic 
pet nurtured in this electronic apparatus a variety of 
rules and various kinds of information. It should be 
noted, however, that the scope of the present invention 
Is not limited to this embodiment. For example, only 
some of the processing described above can be made 
to be carried out when necessary. In addition, the elec- 
tronic pet apparatus makes an access to the network 
periodically when the user carries out a predetermined 
operation or when a call is received from another appa- 
ratus. 

[0179] Moreover, according to the embodiment 
described above, the electronic pet apparatus is con- 
nected to a network by a telephone line. It is worth not- 
ing, however, that the invention can also be applied to 
applications wherein the electronic pet apparatus is 
connected to a network through other equipment such 
as a modem or a personal computer 
[0180] Furthemnore, in the embodiment described 
above, the so-called electronic pet learns recognition 
data, pattern data, voice data and picture data down- 
loaded from a network. It should be noted, however, that 
the scope of the present invention is not limited to this 
embodiment For example, the electronic pets may also 
learn only some of the downloaded data as necessary. 
In addition, the technique itself to recognize a voice, the 
technique to generate voice data and the technique to 
generate picture data themselves can be modified by 
downloaded control programs describing the tech- 
niques. By the same token, the technique to generate 
emotion data and the processing of the response-sen- 
tence creation module and other processing can also be 
changed. 



[0181] On the top of that, according to the embodi- 
ment described above, physical-condition data, emotion 
data and a conversation history can be transmitted to 
another apparatus in order to take out the electronic pet 

5 thereto. It is worth noting, however, that the scope of the 
present invention is not limited to such an embodiment. 
For example, when only some of the data is transmitted 
or the data is transmitted along with information such as 
knowledge, other apparatus may can7 out processing 

10 to emulate the electronic pet of this electronic pet appa- 
ratus. In addition, instead of transmitting such data, a 
response to an input obtained as a result of voice recog- 
nition carried out by another apparatus can be transmit- 
ted to the other apparatus. 

15 [0182] Furthermore, according to the embodiment 
described above, various kinds of data can be input 
from another apparatus in order to bring out the elec- 
tronic pet of the other apparatus to this electronic pet 
apparatus. It should be noted, however, that the scope 

20 of the present invention is not limited to such an embod- 
iment. For example, when only some of the data is 
received or the data received along with information 
such as knowledge, processing to emulate the elec- 
tronic pet of the other apparatus can be can-led out. In 

25 addition, instead of internally processing such data 
received from the other equipment, this electronic pet 
apparatus may transmit a result of voice recognition to 
the other apparatus and then receives a response to the 
result of voice generation from the other apparatus. 

30 [0183] Moreover, in the embodiment described 
above, a voice input is subjected to a voice recognition 
process to convert the input into a series of phonemes. 
It is worth noting, however, that the scope of the present 
invention is not limited to such an embodiment. For 

35 example, a variety of voice recognition techniques 
proper for processing requirements can also be 
adopted. 

[0184] On the top of that, in the embodiment 
described above, a word and the attribute of the word 

40 are each entered as a voice input to be cataloged in the 
electronic pet apparatus. It should be noted, however, 
that the scope of the present invention is not limited to 
such an embodiment. For example, an attribute can be 
selected and entered to the electronic pet apparatus by 

45 the user by operating an operator. In this case, there is 
a conceivable technique whereby the user is requested 
to enter an attribute by selecting an item on a displayed 
menu. 

[0185] Furthemiore, according to the embodiment 

50 described above, for a voice input to be cataloged in an 
authentication data as text data of a series of phonemes 
representing the voice input, a result of voice recogni- 
tion is output as a series of phonemes. As for an ordi- 
nary result of voice recognition, ordinary text data is 

55 merely produced. It is worth noting, however, that the 
scope of the present invention is not limited to such an 
embodiment. For example, also for an ordinary result of 
voice recognition, the result of voice recognition can be 
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output as a series of phonemes. 
[0186] Moreover, in the embodiment described 
above, the user is authenticated by identifying the name 
or the favorite of the user. It should be noted, however, 
that the scope of the present invention is not limited to 
such an embodiment. For example, the present inven- 
tion can also be applied to a wide range of applications 
wherein the user is authenticated by verifying a pass- 
word, a date of a past conversation or a past result of 
voice recognition. 

[0187] On the top of that, in the embodiment 

described above, the user is authenticated by checl<ing 
a special phrase said by the user in response to a pre- 
determined query made by the electronic pet apparatus 
and the user is authenticated periodically. It is worth not- 
ing, however, that the scope of the present invention is 
not limited to such an embodiment. For example, the 
user can also be authenticated either by verification of a 
special phrase or periodically as necessary. 
[0188] Furthermore, according to the embodiment 
described above, in a process to recognize a voice input 
by splitting the input Into a series of phonemes, the user 
is authenticated by verifying a generated voice repre- 
senting a special word. It should be noted, however, that 
the scope of the present invention is not limited to such 
an embodiment. For example, the user can also be 
authenticated by verifying a variety of characteristic 
quantities representing the characteristics of the user's 
voice to give the same effect as the embodiment 
described above. Examples of the characteristics quan- 
tities are the tone and the frequency spectrum of the 
voice. 

[0189] Moreover, in the embodiment described 
above, a response of the electronic pet for the owner 
can be made different from a response for a person 
other than the owner. It is worth noting, however, that 
the scope of the present invention is not limited to such 
an embodiment. For example, more different responses 
can be generated for more different persons providing 
voice inputs such as members of the family of the owner 
and persons other than family members. 
[0190] On the top of that, in the embodiment 
described above, an inquiry is prevented from being 
issued in response to an inquiry In a simple manner 
based on the type of the inquiry input and the type of the 
inquiry response. It should be noted, however, that the 
scope of the present invention is not limited to such an 
embodiment. For example, an inquiry may be issued in 
response to an inquiry due to reasons such as the emo- 
tion. In this case, it shows that the electronic pet is in the 
bad mood. 

[0191] Furthermore, according to the embodiment 
described above, the emotion is controlled by manipula- 
tion of character data. It is worth noting, however, that 
the scope of the present invention is not limited to such 
an embodiment. For example, the emotion data can 
also be changed directly Instead of manipulating the 
character data. 



[0192] Moreover, the embodiment described above 

outputs voice data and picture data. It should be noted, 
however, that the scope of the present invention is not 
limited to such an embodiment. For example, voices 
5 and pictures are output as a result of audio and video 
syntheses. 

[0193] On the top of that, in the embodiment 
described above, the voice recognition processing and 
the picture synthesis processing are carried out by the 

10 central processing unit as shown In Fig. 3. It is worth 
noting, however, that the scope of the present invention 
is not limited to such an embodiment. For example, the 
voice recognition processing and the picture synthesis 
processing can also be carried out by dedicated circuits 

15 as shown in Fig. 28. 

[0194] Furthermore, the embodiment described 
above applies the present invention to an electronic pet 
apparatus outputting a voice and a picture as a 
response. It should be noted, however, that the scope of 

20 the present invention is not limited to such an embodi- 
ment. For example, the present invention can also be 
applied for example to a robot moving tike an animal, an 
electronic pet apparatus moving and crying to output a 
response and an electronic pet apparatus outputting 

25 responses in a variety of forms. 

[0195] Moreover, the embodiment described above 
applies the present invention to an electronic pet appa- 
ratus which is a special-purpose apparatus for emulat- 
ing an electronic pet with the front panel thereof shown 

30 in Fig. 2. It is worth noting, however, that the scope of 
the present invention is not limited to such an embodi- 
ment. For example, the present invention can also be 
applied to a variety of portable devices such as a porta- 
ble telephone, a portable GPS, a portable tape recorder 

35 and a portable optical-disc drive with a front panel 
thereof shown in Fig. 28. In addition to such portable 
devices, the present invention can also be applied to 
information processing apparatuses such as a personal 
computer in which a variety of animation characters or 

40 the like move. 

Industrial Applicability 

[0198] The present invention can be utilized for an 
45 entertainment robot. 

Reference Numerals 

[0197] 

50 

1 ... Electronic pet apparatus; 11 A ... Voice recogni- 
tion module; 11B ... Timer; 11C ... Physical-condi- 
tion changing module; 11D ... Emotion changing 
module; 11 E ... Response-sentence creation mod- 
55 ule; 1 1 F ... Voice synthesis module; 1 1G ... Picture 
synthesis module; 1 1 1 ... Cataloging module; 1 1 J ... 
Voice recognition module; 11M ... Word/phrase 
classification module; 16A ... Recognition data; 16B 
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... Physical-condition data; 16C ... Emotion data; 
16D ... Character data; 16E ... Pattern data; 16F ... 
Conversation history; 16G ... Knowledge base; 16H 
... Voice data; 161 ... Picture data; 16J ... Authentica- 
tion state; 16K ... Authentication data; 16M ... Clas- 
sification rule; 16N ... Ennotlon-change history; 17 
... Network connection unit. 

Claims 

1 . An infomnation processing apparatus characterized 
in that said apparatus comprises; 

voice input means for inputting a voice output 
by the user; 

voice recognition means for recognizing a 
voice received from said voice input means and 
for outputting a result of voice recognition in 
conformity with a predetermined recognition 
rule; 

emotion generation means for generating an 
emotion parameter, which indicates an emotion 
in a pseudo manner as well as varies at least in 
accordance with a result of voice recognition 
and varies with the lapse of time, in conformity 
with a predetermined emotion-parameter gen- 
eration rule; 

response generation means for generating a 
response to a result of voice recognition in con- 
formity with a predetemnined response genera- 
tion rule based on at least said emotion 

parameter; 

response output means for outputting said 
response; and 

communication means for can7ing out 
processing to update said recognition rule, said 
emotion-parameter generation rule and said 
response generation rule by connection to a 
predetermined network; or 
. communication means for canning out 
processing to update data necessary for said 
recognition rule, said emotion-parameter gen- 
eration rule and said response generation rule 
by connection to said predetermined network. 

2. The information processing apparatus according to 
claim 1 , said apparatus characterized in that said 
communication means periodically connects said 
information processing apparatus to said network in 
order to carry out said update processing. 

3. The information processing apparatus according to 
claim 1, said apparatus characterized in that at 
least said emotion parameter or data required in 
generation of said emotion parameter can be 
updated by using data stored In replaceable record- 
ing media. 



4. An infomnation processing apparatus characterized 
in that said apparatus comprises: 

voice input means for inputting a voice output 

5 by the user; 

voice recognition means for recognizing a 
voice received from said voice input means and 
for outputting a result of voice recognition in 
confomnity with a predetermined recognition 

w rule; 

emotion generation means for generating an 
emotion parameter, which indicates an emotion 
in a pseudo manner as well as varies at least in 
accordance with a result of voice recognition 

15 and varies with the lapse of time, in confomnity 

with a predetermined emotion-parameter gen- 
eration rule; 

response generation means for generating a 
response to a result of voice recognition in con- 
20 formity with a predetermined response genera- 

tion rule based on at least said emotion 
parameter; 

response output means for outputting said 
response; and 

25 communication means for carrying out 

processing to acquire at least said emotion 
parameter or data necessary for generating 
said emotion parameter by connection to said 
predetermined network, 

30 wherein said response generation means gen- 

erates a response depending on said emotion 
parameter acquired by said communication 
means or a response depending on an emotion 
parameter generated from said data acquired 

35 by said communication means. 

5. The information processing apparatus according to 
claim 4, said apparatus characterized in that at 
least said emotion parameter or data required in 

40 generation of said emotion parameter can be 
updated by using data stored in replaceable record- 
ing media. 

6. A portable apparatus characterized in that said- 
45 apparatus comprises: 

voice input means for inputting a voice output 
by the user; 

voice recognition means for recognizing a 
50 voice received from said voice input means and 

for outputting a result of voice recognition in 
confomnity with a predetermined recognition 
rule; 

emotion generation means for generating an 
55 emotion parameter, which indicates an emotion 

in a pseudo manner as well as varies at least in 
accordance with a result of voice recognition 
and varies with the lapse of time, in conformity 
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with a predetermined emotion-parameter gen- 
eration rule; 

response generation means for generating a 
response to a result of voice recognition in con- 
formity with a predetermined response genera- 5 
tion rule based on at least said emotion 
parameter; 

response output means for outputting said 
response; and 

communication means for canrying out io 
processing to update said recognition rule, said 
emotion-parameter generation rule and said 
response generation rule by connection to a 
predetermined network; or 
communication means for carrying out is 
processing to update data necessary for said 
recognition rule, said emotion-parameter gen- 
eration rule and said response generation rule 
by connection to said predetennined network. 

20 

7. The portable apparatus according to claim 6, said 
apparatus characterized in that said communica- 
tion means periodically connects said portable 
apparatus to said network in order to carry out said 
update processing. 25 

8. The portable apparatus according to claim 6, said 
apparatus characterized in that at least said emo- 
tion parameter or data required in generation of 
said emotion parameter can be updated by using 30 
data stored In replaceable recording media. 

9. The portable apparatus characterized in that said 
apparatus comprises: 

35 

voice input means for inputting a voice output 
by the user; 

voice recognition means for recognizing a 
voice received from said voice input means and 
for outputting a result of voice recognition in 40 
conformity with a predetermined recognition 
rule; 

emotion generation means for generating an 
emotion parameter, which indicates an emotion 
in a pseudo manner as well as varies at least in 45 
accordance with a result of voice recognition 
and varies with the lapse of time, in conformity 
with a predetermined emotion-parameter gen- 
eration rule; 

response generation means for generating a 50 
response to a result of voice recognition in con- 
formity with a predetermined response genera- 
tion rule based on at least said emotion 
parameter; 

response output means for outputting said 55 
response; and 

communication means for canning out 
processing to acquire at least said emotion 



parameter or data necessary for generating 
said emotion parameter by connection to said 
predetennined network, 
wherein said response generation means gen- 
erates a response depending on said emotion 
parameter acquired by said communication 
means or a response depending on an emotion 
parameter generated from said data acquired 
by said comnnunication means. 

10. The portable apparatus according to claim 9, said 
apparatus characterized in that at least said emo- 
tion parameter or data required in generation of 
said emotion parameter can be updated by using 
data stored in replaceable recording media. 

11. An electronic pet apparatus characterized in that 
said apparatus comprises: 

voice input means for inputting a voice output 
by the user; 

voice recognition means for recognizing a 
voice received from said voice input means and 
for outputting a result of voice recognition in 
conformity with a predetermined recognition 

rule; 

emotion generation means for generating an 
emotion parameter, which indicates an emotion 
in a pseudo manner as well as varies at least in 
accordance with a result of voice recognition 

and varies with the lapse of time, in conformity 
with a predetermined emotion-parameter gen- 
eration rule; 

response generation means for generating a 
response to a result of voice recognition In con- 
formity with a predetermined response genera- 
tion rule based on at least said emotion 
parameter; 

response output means for outputting said 
response; and 

communication means for carrying out 
processing to update said recognition rule, said 
emotion-parameter generation rule and said 
response generation rule by connection to a 
predetennined network; or 
communication means for carrying out 
processing to update data necessary for said 
recognition rule, said emotion-parameter gen- 
eration rule and said response generation rule 
by connection to said predetennined network. 

12. The electronic pet apparatus according to claim 1 1 , 
said apparatus characterized in that said communi- 
cation means periodically connects said electronic 
pet apparatus to said network in order to cany out 
said update processing. 

13. The electronic pet apparatus according to claim 1 1 
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, said apparatus characterized in that at least said 
emotion parameter or data required in generation of 
said emotion parameter can be updated by using 
data stored in replaceable recording media. 

14. An electronic pet apparatus characterized in that 
said apparatus comprises: 

voice input means tor inputting a voice output 
by the user; 

voice recognition means for recognizing a 
voice received from said voice input means and 
for outputting a result of voice recognition in 
conformity with a predetermined recognition 
rule; 

emotion generation means for generating an 
emotion parameter, which indicates an emotion 
in a pseudo manner as well as varies at least in 
accordance with a result of voice recognition 
and varies with the lapse of time, in conformity 
with a predetermined emotion-parameter gen- 
eration rule; 

response generation means for generating a 
response to a result of voice recognition in con- 
formity with a predetennined response genera- 
tion rule based on at least said emotion 
parameter; 

response output means for outputting said 
response; and 

communication means for carrying out 
processing to acquire at least said emotion 
parameter or data necessary for generating 
said emotion parameter by connection to said 
predetermined network, 
wherein said response generation means gen- 
erates a response depending on said emotion 
parameter acquired by said communication 
means or a response depending on an emotion 
parameter generated from said data acquired 
by said communication means. 

15. The electronic pet apparatus according to claim 14 

, said apparatus characterized in that at least said 
emotion parameter or data required in generation of 
said emotion parameter can be updated by using 
data stored in replaceable recording media. 

16. A recording medium storing information processing 
procedures characterized in that said procedure 
comprises: 

voice input sub-procedure for inputting a voice 
output by the user; 

voice recognition sub-procedure for recogniz- 
ing a voice received from said voice input 
means and for outputting a result of voice rec- 
ognition in conformity with a predetermined 
recognition rule; 



emotion generation sub-procedure for generat- 
ing an emotion parameter, which indicates an 
emotion in a pseudo manner as well as varies 
at least in accordance with a result of voice rec- 

5 ognition and varies with the lapse of time, in 

confomiity with a predetermined emotion- 
parameter generation rule; 
response generation sub-procedure for gener- 
ating a response to a result of voice recognition 

10 in conformity with a predetennined response 

generation rule based on at least said emotion 
parameter; 

response output sub-procedure for outputting 
said response; and 

15 communication sub-procedure for carrying out 

processing to update said recognition rule, said 
emotion-parameter generation rule and said 
response generation rule by connection to a 
predetermined network; or 

20 communication sub-procedure for carrying out 

processing to update data necessary for said 
recognition rule, said emotion-parameter gen- 
eration rule and said response generation rule 
by connection to said predetermined network. 

25 

17. A recording medium storing information processing 
procedures according to claim 16, said medium 
characterized in that said communication sub-pro- 
cedure periodically sets connection to said network 

30 in order to carry out said update processing. 

18. A recording medium storing infonnation processing 
procedures characterized in that said procedure 
comprises: 



voice input sub-procedure for inputting a voice 
output by the user; 

voice recognition sub-procedure for recogniz- 
ing a voice received from said voice input 
40 means and for outputting a result of voice rec- 

ognition in conformity with a predetermined 
recognition rule; 

emotion generation sub-procedure for generat- 
ing an emotion parameter, which indicates an 

45 emotion in a pseudo manner as well as varies 

at least in accordance with a result of voice rec- 
ognition and varies with the lapse of time, in 
confomnity with a predetermined emotion- 
parameter generation rule; 

50 response generation sub-procedure for gener- 

ating a response to a result of voice recognition 
in conformity with a predetermined response 
generation rule based on at least said emotion 
parameter; 

55 response output sub-procedure for outputting 

said response; and 

communication sub-procedure for carrying out 
processing to acquire said emotion parameter 
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or data necessary for generating said emotion 
parameter by connection to said predeter- 
mined network, 

whereby said response generation sub-proce- 
dure generates a response depending on said 5 
emotion parameter acquired by said communi- 
cation sub-procedure or a response depending 
on an emotion parameter generated from said 
data acquired by said communication sub-pro- 
cedure. 10 

19. An information processing metliod characterized in 
that said method comprises the steps of: 

inputting a voice output by the user; is 
recognizing a voice received from said voice 
input means and for outputting a result of voice 
recognition in conformity witli a predetennined 

recognition rule; 

generating an emotion parameter, which indi- 20 
cates an emotion in a pseudo manner as well 
as varies at least in accordance with a result of 
voice recognition and varies with the lapse of 
time, in confomiity with a predetennined emo- 
tion-parameter generation rule; 25 
generating a response to a result of voice rec- 
ognition in conformity with a predetermined 
response generation rule based on at least 
said emotion parameter; 

outputting said response; and 30 

carrying out communication processing to 
update said recognition rule, said emotion- 
parameter generation rule and said response 
generation rule by connection to a predeter- 
mined network; or 35 
canning out communication processing to 
update data necessary for said recognition 
rule, said emotion-parameter generation rule 
and said response generation rule by connec- 
tion to said predetennined network. 40 

20. The information processing method according to 
claim 19, said method characterized in that, at said 
step of carrying out update processing, connection 

to said network is set periodically in order to canry 45 
out said update processing. 

21. An information processing method characterized in 
that said method comprises the steps of: 

50 

inputting a voice output by the user; 
recognizing a voice received from said voice 
input means and for outputting a result of voice 
recognition in conformity with a predetennined 
recognition rule; 55 
generating an emotion parameter, which indi- 
cates an emotion in a pseudo manner as well 
as varies at least in accordance with a result of 



voice recognition and varies with the lapse of 
time, in confomnity with a predetermined emo- 
tion-parameter generation rule; 
generating a response to a result of voice rec- 
ognition in conformity with a predetermined 
response generation rule based on at least 
said emotion parameter; 
outputting said response; and 
carrying out communication processing to 
acquire said emotion parameter or data neces- 
sary for generating said emotion parameter by 
connection to said predetermined network, 
whereby, at said step of generating a response, 
there is output a response depending on said 
emotion parameter acquired at said step of car- 
rying out update processing or a response 
depending on an emotion parameter generated 
from said data acquired at said step of carrying 
out update processing. 

22. An information processing apparatus characterized 
in that said apparatus comprises: 

voice input means for inputting a voice output 
by the user; 

voice recognition means for recognizing a 
voice received from said voice input means and 
for outputting a result of voice recognition; 
data base comprising a result of voice recogni- 
tion of a word included in a voice and the type 
of said word; 

response generation means for searching, in 
accordance with a result of voice recognition, 
said data base for the type of a word included 
in a voice represented by said result of voice 
recognition and for generating a response to 
said result of voice recognition in dependence 
of said type; 

response output means for outputting said 
response; and 

cataloging means capable of changing said 
data base in accordance with a voice repre- 
senting a word in a cataloging operation mode 
at least by cataloging a result of recognition of 
said word into said data base. 

23. The information processing apparatus according to 
claim 22, said apparatus characterized in that: 

said voice recognition means identifies a voice 
and outputs a result of voice recognition as a 
series of phonemes; and 
said cataloging means records a result of voice 
recognition of a word included in a voice and 
the type of said word according to a series of 
phonemes representing a result of voice recog- 
nition into said data base. 
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24. The information processing apparatus according to 
claim 22, said apparatus characterized in that said 
voice recognition means outputs a result of voice 
recognition as text data obtained as a result of con- 
version of a voice. 

25. The information processing apparatus according to 
claim 22, said apparatus characterized in that, in a 
cataloging operation mode, said voice recognition 
means carries out a voice recognition process by 
delimiting a voice by using predetemilned delimit- 
ers. 

26. The information processing apparatus according to 
claim 22, said apparatus characterized by further 
having an emotion generation means for generating 
a pseudo emotion parameter, which indicates an 
emotion in a pseudo manner as well as varies at 
least in accordance with a result of voice recogni- 
tion and varies with the lapse of time, In conformity 
with a predetermined pseudo-emotion-parameter 
generation rule, wherein said response generation 
means generates a response to a result of voice 
recognition in confomilty with a predetennined 
response generation rule taking at least said 
pseudo emotion parameter as a reference. 

27. The information processing apparatus according to 
claim 22, said apparatus characterized by being 
capable of exchanging at least said pseudo emo- 
tion parameter or data necessary for generation of 
said pseudo emotion parameter through replacea- 
ble recording media. 

28. A portable apparatus characterized in that said 
apparatus comprises: 

voice input means for inputting a voice output 
by the user; 

voice recognition means for recognizing a 
voice received from said voice input means and 
for outputting a result of voice recognition; 
a data base comprising a result of voice recog- 
nition of a word included in a voice and the type 
of said word; 

response generation means for searching, in 
accordance with a result of voice recognition, 
said data base for the type of a word included 
in a voice represented by said result of voice 
recognition and for generating a response to 
said result of voice recognition in dependence 
of said type; 

response output means for outputting said 
response; and 

cataloging means capable of changing said 
data base In accordance with a voice repre- 
senting a word in a cataloging operation mode 
at least by cataloging a result of recognition of 



said word into said data base. 

29. The portable apparatus according to claim 28, said 
apparatus characterized In that: 

5 

said voice recognition means identifies a voice 
and outputs a result of voice recognition as a 
series of phonemes; and 
said cataloging means records a result of voice 
10 recognition of a word included in a voice and 

the type of said word according to a series of 
phonemes representing a result of voice recog- 
nition into said data base. 

15 30. The portable apparatus according to claim 28, said 
apparatus characterized in that said voice recogni- 
tion means outputs a result of voice recognition as 
text data obtained as a result of conversion of a 
voice. 

20 

31. The portable apparatus according to claim 28, said 
apparatus characterized in that, in a cataloging 
operation mode, said voice recognition means car- 
ries out a voice recognition process by delimiting a 

25 voice by using predetermined delimiters. 

32. The portable apparatus according to claim 28, said 
apparatus characterized by further having an emo- 
tion generation means for generating a pseudo 

30 emotion parameter, which indicates an emotion in a 
pseudo manner as well as varies at least in accord- 
ance with a result of voice recognition and varies 
with the lapse of time, in confomnity with a predeter- 
mined pseudo-emotion-parameter generation rule, 

35 wherein said response generation means gener- 
ates a response to a result of voice recognition in 
conformity with a predetermined response genera- 
tion rule taking at least said pseudo emotion param- 
eter as a reference. 

40 

33. The portable apparatus according to claim 28, said 
apparatus characterized by being capable of 
exchanging at least said pseudo emotion parame- 
ter or data necessary for generation of said pseudo 

45 emotion parameter through replaceable recording 
media. 

34. An electronic pet apparatus characterized in that 
said apparatus comprises: 

50 

voice input means for inputting a voice output 
by the user; 

voice recognition means for recognizing a 
voice received from said voice input means and 
55 for outputting a result of voice recognition; 

a data base comprising a result of voice recog- 
nition of a word Included in a voice and the type 
of said word; 
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response generation means for searching, in 
accordance with a result of voice recognition, 
said data base (or the type of a word included 
in a voice represented by said result of voice 
recognition and for generating a response to $ 
said result of voice recognition in dependence 
of said type; 

response output means for outputting said 
response; and 

cataloging means capable of changing said 10 
data base in accordance with a voice repre- 
senting a word in a cataloging operation mode 
at least by cataloging a result of recognition of 
said word into said data base. 

15 

35. The electronic pet apparatus according to claim 34, 
said apparatus characterized in that: 

said voice recognition means identifies a voice 
and outputs a result of voice recognition as a 20 
series of phonemes; and 
said cataloging means records a result of voice 
recognition of a word included in a voice and 
the type of said word according to a series of 
phonemes representing a result of voice recog- 25 
nition into said data base. 

36. The electronic pet apparatus according to claim 34, 
said apparatus characterized in that said voice rec- 
ognition means outputs a result of voice recognition 30 
as text data obtained as a result of conversion of a 

voice. 

37. The electronic pet apparatus according to claim 34, 
said apparatus characterized In that, in a cataloging 35 
operation mode, said voice recognition means car- 
ries out a voice recognition process by delimiting a 
voice by using predetermined delimiters. 

38. The electronic pet apparatus according to claim 34, 40 
said apparatus characterized by further having an 
emotion generation means for generating a pseudo 
emotion parameter, which indicates an emotion in a 
pseudo manner as well as varies at least in accord- 
ance with a result of voice recognition and varies 45 
with the lapse of time, in conformity with a predeter- 
mined pseudo-emotion-parameter generation rule, 
wherein said response generation means gener- 
ates a response to a result of voice recognition in 
conformity with a predetermined response genera- so 
tion rule taking at least said pseudo emotion param- 
eter as a reference. 

39. The electronic pet apparatus according to claim 34, 
said apparatus characterized by being capable of 55 
exchanging at least said pseudo emotion parame- 
ter or data necessary for generation of said pseudo 
emotion parameter through replaceable recording 



media. 

40. A recording medium storing infomriation processing 
procedures characterized In that said procedure 
comprises: 

a voice input sub-procedure for inputting a 
voice output by the user; 
a voice recognition sub-procedure for recogniz- 
ing a voice received from said voice input sub- 
procedure and for outputting a result of voice 
recognition; 

a response generation sub-procedure for 
searching a data base comprising a result of 
voice recognition of a word included in a voice 
and the type of said word for the type of a spe- 
cific word included in a voice represented by a 
result of voice recognition and for generating a 
response to said result of voice recognition in 
dependence of the type of said specific word; 
a response output sub-procedure for outputting 
said response; and 

a cataloging sub-procedure capable of chang- 
ing said data base in accordance with a voice 
representing a word in a cataloging operation 
mode at least by cataloging a result of recogni- 
tion of said word into said data base. 

41. The recording medium storing information process- 
ing procedures according to claim 40, said medium 
characterized in that: 

said voice recognition sub-procedure identifies 
a voice and outputs a result of voice recognition 
as a series of phonemes; and 
said cataloging sub-procedure records a result 
of voice recognition of a word Included in a 
voice and the type of said word according to a 
series of phonemes representing a result of 
voice recognition into said data base. 

42. The recording medium storing information process- 
ing procedures according to claim 40, said medium 
characterized in that said voice recognition sub- 
procedure outputs a result of voice recognition as 
text data obtained as a result of conversion of a 
voice. 

43. The recording medium storing information process- 
ing procedures according to claim 40, said medium 
characterized in that, in a cataloging operation 
mode, said voice recognition sub-procedure carries 
out a voice recognition process by delimiting a 
voice by using predetemiined delimiters. 

44. The recording medium storing infonnation process- 
ing procedures according to claim 40, said medium 
characterized in that said procedure further 
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includes an emotion generation sub-procedure for 
generating a pseudo emotion parameter, which 
indicates an emotion in a pseudo manner as well as 
varies at least in accordance with a result of voice 
recognition and varies with the lapse of time, in con- 
formity with a predetennined pseudo-emotion- 
parameter generation rule, and said response gen- 
eration sub-procedure generates a response to a 
result of voice recognition in conformity with a pre- 
determined response generation rule tailing at least 
said pseudo emotion parameter as a reference. 

45. An information processing method characterized in 
that said method comprises the steps of: 

inputting a voice output by the user; 
recognizing a voice input at said step of input- 
ting a voice and for outputting a result of voice 

recognition; 

searching a data base comprising a result of 
voice recognition of a word included in a voice 
and the type of said word for the type of a spe- 
cific word included in a voice represented by a 
result of voice recognition and generating a 
response to said result of voice recognition in 
dependence of the type of said specific word; 
outputting said response; and 
changing said data base in accordance with a 
voice representing a word in a cataloging oper- 
ation mode at least by cataloging a result of 
recognition of said word into said data base. 

46. The information processing method according to 
claim 45, said method characterized in that: 

at said step of recognizing a voice, a voice is 
identified and a result of voice recognition is 
output as a series of phonemes; and 
at said step of changing said data base, 
according to a series of phonemes represent- 
ing a result of voice recognition, a result of 
voice recognition of a word included in a voice 
and the type of said word are recorded into said 
data base. 

47. The information processing method according to 
claim 45, said method characterized in that, at said 
step of recognizing a voice, a result of voice recog- 
nition is output as text data obtained as a result of 
conversion of a voice. 

48. The information processing method according to 
claim 45, said method characterized in that, in a 
cataloging operation mode, at said step of recog- 
nizing a voice, a voice recognition process is car- 
ried out by delimiting a voice by using 
predetermined delimiters. 



49. The Information processing method according to 
claim 45, said method characterized by further 
including the step of generating a pseudo emotion 
parameter, which indicates an emotion in a pseudo 

5 manner as well as varies at least in accordance 
with a result of voice recognition and varies with the 
lapse of time, in conformity with a predetermined 
pseudo-emotion-parameter generation rule charac- 
terized in that, at said step of generating a 

10 response, a response to a result of voice recogni- 
tion is generated in conformity with a predeter- 
mined response generation rule taking at least said 
pseudo emotion parameter as a reference. 

15 50. An infonmation processing apparatus characterized 
in that said apparatus comprises: 

voice input means for inputting a voice output 
by the user; 

20 voice recognition means for recognizing a 

voice received from said voice input means and 
for outputting a result of voice recognition; 
response generation means for generating a 
response to a result of voice recognition in con- 

25 formity with a predetermined response genera- 

tion rule; 

response output means for outputting said 
response; and 

user authentication means for authenticating 
30 the user on the basis of a voice output by said 

user, 

wherein said response generation means gen- 
erates a response to a person entering a voice 
with said response varied in dependence on a 
35 result of authentication produced by said user 

authentication means. 

51. The information processing apparatus according to 
claim 50, said apparatus characterized in that said 
40 user authentication means forms a judgment on a 
result of voice recognition with a past result of voice 
recognition used as a reference and authenticates 
the user on the basis of a result of said judgment. 

45 52. The information processing apparatus according to 
claim 51 , said apparatus characterized in that: 

said response generation means raises a 
query about a past result of voice recognition 

50 as a response; and 

said user authentication means fomns a judg- 
ment on a result of voice recognition of a 
response to said query in order to authenticate 
the user. 

55 

53. The information processing apparatus according to 

claim 51, said apparatus characterized in that said 
past result of voice recognition is a predetermined 
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word. 

54. The information processing apparatus according to 
claim SO, said apparatus characterized in that said 
user authentication means detects a characteristic 
quantity representing a characteristic of a voice out- 
put by the user from a result of voice recognition 
and authenticates said user on the basis of said 
characteristic quantity. 

55. The infonnation processing apparatus according to 
claim 50, said apparatus further having an emotion 
generation means for generating a pseudo emotion 
parameter, which indicates an emotion in a pseudo 
manner as well as varies at least in accordance 
with a result of voice recognition and varies with the 
lapse of time, in conformity with a predetermined 
pseudo-emotion-parameter generation rule, 
wherein said response generation means gener- 
ates a response to a result of voice recognition in 
conformity with a predetermined response genera- 
tion rule taking at least said pseudo emotion param- 
eter as a reference. 

56. A portable processing apparatus characterized in 
that said apparatus comprises: 

a voice input means for inputting a voice output 
by the user; 

a voice recognition means for recognizing a 
voice received from said voice input means and 
for outputting a result of voice recognition; 
a response generation means for generating a 
response to a result of voice recognition in con- 
formity with a predetermined response genera- 
tion rule; 

a response output means for outputting said 
response; and 

a user authentication means for authenticating 
the user on the basis of a voice output by said 
user, 

wherein said response generation means gen- 
erates a response to a person entering a voice 
with said response varied in dependence on a 
result of authentication produced by said user 
authentication means. 

57. The portable processing apparatus according to 
claim 56, said apparatus characterized in that said 
user authentication means fonns a judgment on a 
result of voice recognition with a past result of voice 
recognition used as a reference and authenticates 
the user on the basis of a result of said judgment. 

58. The portable processing apparatus according to 
claim 57, said apparatus characterized in that: 

said response generation means raises a 



query about a past result of voice recognition 
as a response; and 

said user authentication means forms a judg- 
ment on a result of voice recognition of a 
5 response to said query in order to authenticate 

the user. 

59. The portable processing apparatus according to 
claim 57, said apparatus characterized in that said 

10 past result of voice recognition is a predetermined 
word. 

60. The portable processing apparatus according to 
claim 56, said apparatus characterized in that said 

15 user authentication means detects a characteristic 
quantity representing a characteristic of a voice out- 
put by the user from a result of voice recognition 
and authenticates said user on the basis of said 
characteristic quantity. 

20 

61. The portable processing apparatus according to 
claim 56, said apparatus characterized by further 
having an emotion generation means for generating 
a pseudo emotion parameter, which indicates an 

25 emotion in a pseudo manner as well as varies at 
least in accordance with a result of voice recogni- 
tion and varies with the lapse of time, in conformity 
with a predetennined pseudo-emotion-parameter 
generation rule, wherein said response generation 

30 means generates a response to a result of voice 
recognition in conformity with a predetermined 
response generation rule taking at least said 
pseudo emotion parameter as a reference. 

35 62. An electronic pet apparatus characterized in that 
said apparatus comprises: 

voice input means for inputting a voice output 
by the user; 

40 voice recognition means for recognizing a 

voice received from said voice Input means and 
for outputting a result of voice recognition; 
response generation means for generating a 
response to a result of voice recognition in con- 

45 formity with a predetermined response genera- 

tion rule; 

response output means for outputting said 
response; and 

user authentication means for authenticating 
50 the user on the basis of a voice output by said 

user, 

wherein said response generation means gen- 
erates a response to a person entering a voice 
with said response varied in dependence on a 
55 result of authentication produced by said user 

authentication means. 

63. The electronic pet apparatus according to claim 62, 
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said apparatus characterized in that said user 
authentication means fonns a judgment on a result 
of voice recognition with a past result of voice rec- 
ognition used as a reference and authenticates the 
user on the basis of a result of said judgment. 5 

64. The electronic pet apparatus according to claim 63, 
said apparatus characterized in that: 

said response generation means raises a w 
query about a past result of voice recognition 

as a response; and 

said user authentication means forms a judg- 
ment on a result of voice recognition of a 
response to said query in order to authenticate 15 
the user. 

65. The electronic pet apparatus according to claim 63, 
said apparatus characterized in that said past result 

of voice recognition is a predetermined word. 20 

66. The electronic pet apparatus according to claim 62, 
said apparatus characterized in that said user 
authentication means detects a characteristic 
quantity representing a characteristic of a voice out- 25 
put by the user from a result of voice recognition 
and authenticates said user on the basis of said 
characteristic quantity. 

67. The electronic pet apparatus according to claim 62, 30 
said apparatus characterized by further having an 
emotion generation means for generating a pseudo 
emotion parameter, which indicates an emotion in a 
pseudo manner as well as varies at least in accord- 
ance with a result of voice recognition and varies 55 
with the lapse of time, in conformity with a predeter- 
mined pseudo-emotion-parameter generation rule, 
wherein said response generation means gener- 
ates a response to a result of voice recognition in 
confonnity with a predetermined response genera- 40 
tion rule taking at least said pseudo emotion param- 
eter as a reference. 

68. A recording medium storing information processing 
procedures characterized in that said procedure 45 
comprises: 

a voice Input sub-procedure for inputting a 
voice output by the user; 

a voice recognition sub-procedure for recogniz- 50 
ing a voice received from said voice input sub- 
procedure and for outputting a result of voice 
recognition; 

a response generation sub-procedure for gen- 
erating a response to a result of voice recogni- 55 
tion in confonnity with a predetennined 
response generation rule; 
a response output sub-procedure for outputting 
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said response; and 

a user authentication sub-procedure for 
authenticating the user on the basis of a voice 
output by said user, 

whereby said response generation sub-proce- 
dure generates a response to a person enter- 
ing a voice with said response varied in 
dependence on a result of authentication pro- 
duced by said user authentication sub-proce- 
dure. 

69. The recording medium used for storing information 
processing procedures according to claim 68, said 
medium characterized in that said user authentica- 
tion sub-procedure fonms a judgment on a result of 
voice recognition with a past result of voice recogni- 
tion used as a reference and authenticates the user 
on the basis of a result of said judgment. 

70. The recording medium storing infonnation process- 
ing procedures according to claim 69, said medium 
characterized in that: 

said response generation sub-procedure 
raises a query about a past result of voice rec- 
ognition as a response; and 
said user authentication sub-procedure forms a 
judgment on a result of voice recognition of a 
response to said query in order to authenticate 
the user 

71. The recording medium storing information process- 
ing procedures according to claim 69, said medium 
characterized in that said past result of voice recog- 
nition Is a predetermined word. 

72. The recording medium storing information process- 
ing procedures according to claim 68, said medium 
characterized in that said user authentication sub- 
procedure detects a characteristic quantity repre- 
senting a characteristic of a voice output by the 
user from a result of voice recognition and authenti- 
cates said user on the basis of said characteristic 
quantity. 

73. The recording medium storing information process- 
ing procedures according to claim 68, said medium 
characterized in that said procedure further has an 
emotion generation sub-procedure for generating a 
pseudo emotion parameter, which indicates an 
emotion in a pseudo manner as well as varies at 
least in accordance with a result of voice recogni- 
tion and varies with the lapse of time, in confonnity 
with a predetennined pseudo-emotion-parameter 
generation rule, wherein said response generation 
sub-procedure generates a response to a result of 
voice recognition in conformity with a predeter- 
mined response generation rule taking at least said 
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pseudo emotion parameter as a reference. 

74. An information processing method characterized in 
that said method comprises the steps of: 

inputting a voice output by the user; 
recognizing a voice Input at said step of input- 
ting a voice and outputting a result of voice rec- 
ognition; 

generating a response to a result of voice rec- 
ognition in conformity with a predetennined 

response generation rule; 
outputting said response; and 
authenticating the user on the basis of a voice 
output by said user, 

whereby, at said step of generating a response, 
a response to a person entering a voice is gen- 
erated, being varied in dependence on a result 
of authentication produced at said step of 

authenticating the user. 

75. The information processing method according to 
claim 74, said method characterized in that, at said 
step of authenticating the user, a judgment on a 
result of voice recognition is formed with a past 
result of voice recognition used as a reference and 
said user is authenticated on the basis of a result of 
said judgment. 

76. The information processing method according to 
claim 75, said method characterized in that: 

at said step of generating a response, a query 
about a past result of voice recognition is 
raised as a response; and 
at said step of authenticating the user, a judg- 
ment on a result of voice recognition of a 
response to said query is formed in order to 
authenticate said user. 

77. The information processing method according to 
claim 75, said method characterized in that said 
past result of voice recognition is a predetermined 
word. 

78. The information processing method according to 
claim 74, said method characterized in that, at said 
step of authenticating the user, a characteristic 
quantity representing a characteristic of a voice out- 
put by said user is detected from a result of voice 
recognition and said user is authenticated on the 
basis of said characteristic quantity. 

79. The information processing method according to 
claim 74, said method characterized by further hav- 
ing a step of generating a pseudo emotion parame- 
ter, which indicates an emotion in a pseudo manner 
as well as varies at least in accordance with a result 



of voice recognition and varies with the lapse of 
time, in conformity with a predetermined pseudo- 
emotion-parameter generation rule, whereby, at 
said step of generating a response, a response to a 
5 result of voice recognition is generated in conform- 
ity with a predetermined response generation rule 
talcing at least said pseudo emotion parameter as a 
reference. 

10 80, An infomiation processing apparatus characterized 
in that said apparatus comprises: 

voice input means for inputting a voice output 
by the user; 

15 voice recognition means for recognizing a 

voice received from said voice input means and 
for outputting a result of voice recognition; 
response generation means for generating a 

response to a result of voice recognition in con- 
20 formity with a predetermined response genera- 

tion rule; 

response output means for outputting said 
response; and 

word/phrase classification means for identify- 
25 ing the type of an Input represented by a voice 

on the basis of said voice, 
wherein said response generation rule is a rule 
of generating responses excluding a response 
of a predetennined type in accordance with the 
30 type of an input and a category of a response 

to said input. 

81. The information processing apparatus according to 
claim 80, said apparatus characterized by further 

35 having a history recording means used for storing a 
history of at least types of inputs each entered as a 
voice and categories of responses to said inputs 
generated by said response generation means, 
wherein said response generation means gener- 
ic ates a response by referring to said history stored in 
said history recording means. 

82. The information processing apparatus according to 
claim 80, said apparatus characterized by further 

45 having an emotion generation means for generating 
a pseudo emotion parameter, which indicates an 
emotion in a pseudo manner as well as varies at 
least in accordance with a result of voice recogni- 
tion and varies with the lapse of time, in conformity 

50 with a predetennined pseudo-emotion-parameter 
generation rule, wherein said response generation 
means generates a response to a result of voice 
recognition in conformity with a predetermined 
response generation rule taking at least said 

55 pseudo emotion parameter as a reference. 

83. A portable apparatus characterized in that said 
apparatus comprises: 
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voice Input means for inputting a voice output 
by the user; 

voice recognition means for recognizing a 
voice received from said voice input means and 
for outputting a result of voice recognition; 
response generation means for generating a 

response to a result of voice recognition in con- 
formity with a predetermined response genera- 
tion rule; 

response output means for outputting said 
response; and 

word/phrase classification means for identify- 
ing the type of an input represented by a voice 
on the basis of said voice, 
wherein said response generation rule is a rule 
of generating responses excluding a response 
of a predetermined type in accordance with the 
type of an input and a category of a response 
to said input. 

84. The portable apparatus according to claim 83, said 
apparatus characterized by further having a history 
recording means used for storing a history of at 
least types of inputs each entered as a voice and 
categories of responses to said inputs generated by 
said response generation means, wherein said 
response generation means generates a response 
by referring to said history stored in said history 
recording means. 

85. The portable apparatus according to claim 83, said 
apparatus characterized by further having an emo- 
tion generation means for generating a pseudo 
emotion parameter, which indicates an emotion in a 
pseudo manner as well as varies at least in accord- 
ance with a result of voice recognition and varies 
with the lapse of time, in conformity with a predeter- 
mined pseudo-emotion-parameter generation rule, 
wherein said response generation means gener- 
ates a response to a result of voice recognition in 
confomiity with a predetermined response genera- 
tion rule taking at least said pseudo emotion param- 
eter as a reference. 

86. An electronic pet apparatus characterized in that 
said apparatus comprises: 

voice input means for inputting a voice output 
by the user; 

voice recognition means for recognizing a 
voice received from said voice input means and 
for outputting a result of voice recognition; 
response generation means for generating a 
response to a result of voice recognition in con- 
formity with a predetennined response genera- 
tion rule; 

response output means for outputting said 
response; and 



word/phrase classification means for identify- 
ing the type of an input represented by a voice 
on the basis of said voice, 
wherein said response generation rule is a rule 
5 of generating responses excluding a response 

of a predetennined type in accordance with the 
type of an input and a category of a response 
to said input. 

10 87. The electronic pet apparatus according to claim 86, 
said apparatus characterized by further having a 
history recording means used for storing a history 
of at least types of Inputs each entered as a voice 
and categories of responses to said Inputs gener- 

15 ated by said response generation means, wherein 
said response generation means generates a 
response by referring to said history stored in said 
history recording means. 

20 88. The electronic pet apparatus according to claim 86, 
said apparatus characterized by further having an 
emotion generation means for generating a pseudo 
emotion parameter, which indicates an emotion in a 
pseudo manner as well as varies at least in accord- 

25 ance with a result of voice recognition and varies 
with the lapse of time, In conformity with a predeter- 
mined pseudo-emotion-parameter generation rule, 
wherein said response generation means gener- 
ates a response to a result of voice recognition in 

30 conformity with a predetermined response genera- 
tion rule taking at least said pseudo emotion param- 
eter as a reference. 

89, A recording medium storing information processing 
35 procedures characterized in that said procedure 
comprises: 

a voice Input sub-procedure for inputting a 
voice output by the user; 
40 a voice recognition sub-procedure for recogniz- 

ing a voice received from said voice input sub- 
procedure and for outputting a result of voice 
recognition; 

a response generation sub-procedure for gen- 
45 erating a response to a result of voice recogni- 

tion in conformity with a predetermined 

response generation rule; 

a response output sub-procedure for outputting 

said response; and 
50 a word/phrase classification sub-procedure for 

Identifying the type of an input represented by a 

voice on the basis of said voice, 

wherein said response generation rule is a rule 

of generating responses excluding a response 
55 of a predetermined type in accordance with the 

type of an input and a category of a response 

to said input. 
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90. The recording medium storing information process- 
ing procedures according to claim 89, said medium 
characterized in that said procedure further has a 
history recording sub-procedure for storing a his- 
tory of at least types of inputs each entered as a 5 
voice and categories of responses to said inputs 
generated by said response generation sub-proce- 
dure, wherein said response generation sub-proce- 
dure generates a response by referring to said 
history stored by said history recording sub-proce- 
dure. 

91. The recording medium storing information process- 
ing procedures according to claim 89, said medium 
characterized in that said procedure further has an 
emotion generation sub-procedure for generating a 
pseudo emotion parameter, which Indicates an 
emotion in a pseudo manner as well as varies at 
least in accordance with a result of voice recogni- 
tion and varies with the lapse of time, in conformity 
with a predetermined pseudo-emotion-parameter 
generation rule, wherein said response generation 
sub-procedure generates a response to a result of 
voice recognition in conformity with a predeter- 
mined response generation rule taking at least said 
pseudo emotion parameter as a reference. 

92. An information processing method characterized in 
that said method comprises the steps of: 

inputting a voice output by the user; 
recognizing a voice input at said step of input- 
ting a voice and outputting a result of voice rec- 
ognition; 

generating a response to a result of voice rec- 
ognition in conformity with a predetermined 
response generation rule; 
outputting said response; and 
identifying the type of an input represented by a 
voice on the basis of said voice, 
whereby said response generation rule is a rule 
of generating responses excluding a response 
of a predetermined type in accordance with the 
type of an input and a category of a response 
to said input. 

93. The information processing method according to 
claim 92, said method characterized by further hav- 
ing the step of storing a history of at least types of 
inputs each entered as a voice and categories of 
responses to said inputs generated at said step of 
generating a response, whereby at said step of 
generating a response, a response is generated by 
referring to said history stored at said step of stor- 
ing a history. 

94. The information processing method according to 
claim 92, said method characterized by further hav- 



ing the step of generating a pseudo emotion param- 
eter, which indicates an emotion in a pseudo 
manner as well as varies at least in accordance 
with a result of voice recognition and varies with the 
lapse of time, in conformity with a predetermined 
pseudo-emotion-parameter generation rule, 
whereby at said step of generating a response, a 
response to a result of voice recognition is gener- 
ated in confonnity with a predetennined response 
generation rule taking at least said pseudo emotion 
parameter as a reference. 

95. An information processing apparatus characterized 
in that said apparatus comprises: 

voice input means for inputting a voice output 
by the user; 

voice recognition means for recognizing a 
voice received from said voice input means in 
confonnity with a predetermined recognition 
rule and for outputting a result of voice recogni- 
tion; 

emotion generation means for generating an 
emotion parameter, which indicates an emotion 
in a pseudo manner as well as varies at least in 
accordance with a result of voice recognition 
and varies with the lapse of time, in conformity 
with a predetermined emotion-parameter gen- 
eration rule; 

response generation means for generating a 

response to a result of voice recognition in con- 
formity with a predetermined response genera- 
tion rule taking at least said emotion parameter 
as a reference; and 

response output means for outputting said 
response, 

wherein said emotion generation means has a 
history recording means used for recording a 
history of at least said emotion parameter 
along with a result of voice recognition corre- 
sponding to said emotion parameter, and a var- 
iation in said emotion parameter according to a 
result of voice recognition is changed in 
accordance with said history. 

96. The information processing apparatus according to 
claim 95, said apparatus characterized in that: 

said emotion generation means changes said 
emotion parameter in accordance with a word 
which is included in an input voice and excites 
an emotion; and 

when a specific word other than said word 
exciting an emotion is used at the same time as 
said word exciting an emotion and as many 
times as said word exciting an emotion is, said 
specific word used at the same time as said 
word exciting an emotion and as many times as 
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said word exciting an emotion also causes said 
emotion parameter to be changed as said word 
exciting an emotion does. 

97. The infonnation processing apparatus according to 5 
claim 95, said apparatus characterized in that: 

said emotion generation means changes said 
emotion parameter in accordance with a word 
which is included in an input voice and excites w 
an emotion; and 

when a particular word exciting an emotion is 
used frequently, a variation in said emotion 
parameter caused by said particular word 
exciting an emotion is reduced. is 

98. A portable apparatus characterized in that said 
apparatus comprises: 

voice input means for inputting a voice output 20 
by the user; 

voice recognition means for recognizing a 
voice received from said voice Input means in 
conformity with a predetermined recognition 
rule and for outputting a result of voice recogni- 25 

tion; 

emotion generation means for generating an 
emotion parameter, which indicates an emotion 
in a pseudo manner as well as varies at least in 
accordance with a result of voice recognition 30 

and varies with the lapse of time, in conformity 
with a predetermined emotion-parameter gen> 
eration rule; 

response generation means for generating a 
response to a result of voice recognition in con- 35 
formity with a predetermined response genera- 
tion rule taking at least said emotion parameter 
as a reference; and 

response output means for outputting said 
response, 4o 
wherein said emotion generation means has a 
history recording means used for recording a 
history of at least said emotion parameter 
along with a result of voice recognition corre- 
sponding to said emotion parameter, and a var- 45 
iation in said emotion parameter according to a 
result of voice recognition is changed in 
accordance with said history. 

99. The portable processing apparatus according to so 
claim 98, said apparatus characterized in that: 



said word exciting an emotion and as many 
times as said word exciting an emotion is, said 
specific word used at the same time as said 
word exciting an emotion and as many times as 
said word exciting an emotion also causes said 
emotion parameter to be changed as said word 
exciting an emotion does. 

100. The portable apparatus according to claim 98, said 
apparatus characterized in that: 

said emotion generation means changes said 
emotion parameter in accordance with a word 
which is included in an input voice and excites 
an emotion; and 

when a particular word exciting an emotion is 
used frequently, a variation in said emotion 
parameter caused by said particular word 
exciting an emotion is reduced. 

101. An electronic pet apparatus characterized in that 
said apparatus comprises: 

voice input means for inputting a voice output 

by the user; 

voice recognition means for recognizing a 
voice received from said voice input means in 
confomnity with a predetermined recognition 
rule and for outputting a result of voice recogni- 
tion; 

emotion generation means for generating an 
emotion parameter, which indicates an emotion 
in a pseudo manner as well as varies at least in 
accordance with a result of voice recognition 
and varies with the lapse of time, in confomnity 
with a predetermined emotion-parameter gen- 
eration rule; 

response generation means for generating a 
response to a result of voice recognition in con- 
formity with a predetermined response genera- 
tion rule taking at least said emotion parameter 
as a reference; and 

response output means for outputting said 
response, 

wherein said emotion generation means has a 
history recording means used for recording a 
history of at least a result of voice recognition 
and said emotion parameter corresponding to 
said result of voice recognition, and a variation 
in said emotion parameter according to said 
result of voice recognition is changed in 
accordance with said history. 

102. The electronic pet apparatus according to claim 
101 , said apparatus characterized in that: 

said emotion generation means changes said 
emotion parameter in accordance with a word 



said emotion generation means changes said 
emotion parameter in accordance with a word 
which Is included in an input vorce and excites ss 
an emotion; and 

when a specific word other than said word 
exciting an emotion is used at the same time as 
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which is included in an input voice and excites 
an emotion; and 

when a specific word other than said word 
exciting an emotion is used at the same time as 
said word exciting an emotion and as many 
times as said word exciting an emotion is, said 
specific word used at the same time as said 
word exciting an emotion and as many times as 
said word exciting an emotion also causes said 
emotion parameter to be changed as said word 
exciting an emotion does. 

103. The electronic pet apparatus according to claim 
1 01 , said apparatus characterized in that: 

said emotion generation means changes said 
emotion parameter in accordance with a word 
which is included in an input voice and excites 
an emotion; and 

when a particular word exciting an emotion is 
used frequently, a variation in said emotion 
parameter caused by said particular word 
exciting an emotion is reduced. 

104. A recording medium storing information processing 
procedures characterized in that said procedure 

comprises: 

a voice input sub-procedure for inputting a 
voice output by the user; 

a voice recognition sub-procedure for recogniz- 
ing a voice received from said voice input sub- 
procedure in conformity with a predetennined 
recognition rule and for outputting a result of 
voice recognition; 

an emotion generation sub-procedure for gen- 
erating an emotion parameter, which indicates 
an emotion in a pseudo manner as well as var- 
ies at least in accordance with a result of voice 
recognition and varies with the lapse of time, in 
conformity with a predetennined emotion- 
parameter generation rule; 
a response generation sub-procedure for gen- 
erating a response to a result of voice recogni- 
tion in confomnity with a predetermined 
response generation rule taking at least said 
emotion parameter as a reference; and 
a response output sub-procedure for outputting 
said response, 

wherein said emotion generation sub-proce- 
dure has a history recording sub-procedure for 
recording a history of at least a result of voice 
recognition and said emotion parameter corre- 
sponding to said result of voice recognition, 
and a variation in said emotion parameter 
according to said result of voice recognition is 
changed in accordance with said history. 



105.A recording medium storing information processing 
procedures according to claim 104, said medium 
characterized in that: 

5 said emotion generation sub-procedure 

changes said emotion parameter in accord- 
ance with a word which Is included in an input 
voice and excites an emotion; and 
when a specific word other than said word 

10 exciting an emotion is used at the same time as 

said word exciting an emotion and as many 
times as said word exciting an emotion is, said 
specific word used at the same time as said 
word exciting an emotion and as many times as 

15 said word exciting an emotion also causes said 

emotion parameter to be changed as said word 
exciting an emotion does. 

108.A recording medium storing information processing 
20 procedures according to claim 104, said medium 
characterized in that said: 

said emotion generation sub-procedure 
changes said emotion parameter in accord- 

25 ance with a word which is included in an input 

voice and excites an emotion; and 
when a particular word exciting an emotion is 
used frequently, a variation in said emotion 
parameter caused by said particular word 

30 exciting an emotion is reduced. 

107.An information processing method characterized in 
that said method comprises the steps of: 

35 inputting a voice output by the user; 

recognizing a voice input at said voice input 
sub-procedure in confomiity with a predeter- 
mined recognition rule and outputting a result 
of voice recognition; 

40 generating an emotion parameter, which indi- 

cates an emotion in a pseudo manner as well 
as varies at least in accordance with a result of 
voice recognition and varies with the lapse of 
time, in confonnity with a predetermined emo- 

45 tion-parameter generation rule; 

generating a response to a result of voice rec- 
ognition in conformity with a predetermined 
response generation rule taking at least said 
emotion parameter as a reference; and 

50 outputting said response, 

whereby said step of generating a response 
has the sub-step of recording a history of at 
least a result of voice recognition and said 
emotion parameter corresponding to said 

55 result of voice recognition, and a variation in 

said emotion parameter according to a result of 
voice recognition is changed in accordance 
with said history. 
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108. The information processing niethod according to 
claim 1 07, said method cliaracterized in that: 

at said step of generating an emotion parame- 
ter, said emotion parameter is changed in 5 
accordance with a word which is included in an 
input voice and excites an emotion; and 
when a specific word other than said word 
exciting an emotion is used at the same time as 
said word exciting an emotion and as many io 
times as said word exciting an emotion is, said 
specific word used at the same time as said 
word exciting an emotion and as many times as 
said word exciting an emotion also causes said 
emotion parameter to be changed as said word is 
exciting an emotion does. 

109. The information processing method according to 
claim 107, said method characterized in that: 



at said step of generating an emotion parame- 
ter, said emotion parameter is changed in 
accordance with a word which is included in an 
input voice and excites an emotion; and 
when a particular word exciting an emotion is 25 
used frequently, a variation in said emotion 
parameter caused by said particular word 
exciting an emotion is reduced. 



20 



30 



35 



40 



45 



50 



55 



35 



EP 1 072 297 A1 




lie 



RECOGNITION 
DATA 



•5* VOICE 
-^RECOGNITION 
MODULE 



TEXT 



11F 



DB 



VOICE 
SYNTHESIS 
MODULE 



11E 
RESPONSE 



16H 



VOICE DATA* 



DV 




PICTURE 
SYNTHESIS 
MODULE 




^ PHYSICAL-CONDITION 
^ CHANGING MODULE 



PHYSICAL-CONDITION 
DATA 

I6B 



1IB 



TIMER 



IID 



EMOTION 
-H CHANGING 
MODULE 



CHARACTER DATA 
16D 



CURRENT-EMOTION 
DATA 
16C 



RESPONSE-SENTENCE 
CREATION MODULE 




NETWORK CONNECTION UNIT 



KN^N^OKVWfflOHXPATTBWOATA 



-OT 



PICTURE DATA 



1:ELECTR0HIC PET 
— APPARATUS 



FIG. 1 



36 



EP 1 072 297 A1 




FIG. 2 



37 




38 



EP1 072 297A1 



VARIABLE NAME 


VALUE 


FATIGUE 


2 2 


HUNGER 


1 0 


THIRST INESS 


5 


SICKNESS 


1 


SLEEPINESS 


3 



16B 



FIG. 4 



VARIABLE NAME 


VALUE 


ANGER 


2 5 


SADNESS 


1 0 


JOY 


3 0 


FEAR 


8 


SURPRISE 


8 


HATRED 


3 



FIG. 5 



39 



EP 1 072 297 A1 







KEYWORD 






GOOD 


BAD 


HEY 


DIRTY 


VAR 
VARI 


ANGER 


- 1 


+ 1 0 


+ 5 


+ 5 


SADNESS 


+ 2 


+ 1 0 


+ 5 


+ 5 




JOY 


+ 20 


- 1 0 


- 1 0 


- 1 5 


fin Q 


FEAR 


-5 


+ 5 


+ 1 0 


-5 


il 


SURPRISE 


+ 5 


- 1 


+ 1 0 


+ 5 


m « 


HATRED 


- 1 


+ 5 


+ 2 


+ 2 0 



FIG. 6 



VARIABLE NAME 


VALUE 


ANGER 


2 4 


SADNESS 


1 2 


JOY 


5 0 


FEAR 


3 


SURPRISE 


1 3 


HATRED 


2 



FIG. 7 



40 




41 




EP 1 072 297 A1 



16H 



RESPONSE SENTENCE 


VOICE-FILE NAME 


1 LOVE YOU. TOO 


voice0001.wav 


yKW. 1 AM A MALE THOUGH 


voice0002.wav 


SHUT UP 


voice0003. wav 


WHAT? 


voice0004.wav 


HOWDY 


voiceOOOS. wav 


1 AM SURPRISE 


voice0006.wav 


HI 


voice0007.wav 


DID YOU CALL ME? 


voice0008.wav 



FIG. 9 



161 



RESPONSE SENTENCE 


PICTURE-FILE NAME 


1 LOVE YOU. TOO 


flgOOOl.bmp 


wot, 1 AM A MALE THOUGH 


flg0002.birp 


SHUT UP 


f igOOOlbirp 


WHAT? 


fig0004.biiip 


HOWDY 


figOOOS.bmp 


t AM SURPRISE 


figOOOe.bmp 


HI 


figOOOT.bnp 


DID YOU CALL IE? 


fig0008.bnp 



FIG. 10 



42 




EP 1 072 297 A1 



C START h -SPI 







ACCB'T A REQUEST FOR CONNECTIOl{ 






ESTABLISH A COHIUNiCATION 






TRANSFER DATA 






DISCONTINUE THE COMMUNICATION 



-SP3 



'SP4 



'SP5 



( END ^ -spe 
FIG. 11 



HEADER h^DT 
PAHERM DATA 



RECOGNITION DATA 
VOICE DATA 
PICTURE DATA 



FIG. 12 



43 



EP 1 072 297 A1 



RECOGNITION DATA 
--^16A 



VOICE 
RECOGNITION 
MODULE 



1U 



CATALOGING 
MODULE 



111 



RESPONSE 



110 



PHYSICAL-CONDITION 
DATA 



PHYSICAL-CONDITION 
CHANGING MODULE 




16B 



TIMER 



'^IIB 



CHARACTER DATA 
^^^^^16D 



EMOTION CHANGING 
MODULE 



/^IID 



\CURREI 



CURRENT-EMOTION 
DATA 

^16C 



RESPONSE-SENTENCE 
CREATION MODULE 




'11E 




knohledgeCconversation pattern 
base j history data 

16F 



DATA 



l:ELECTRONIC PET 
~" APPARATUS 



FIG. 13 



44 




EP1 072 297A1 



PHONEME 




PHONEME 



FIG. 14 



45 



EP 1 072 297 A1 



C START )~SP11 



START A CATALOGING MODE 



I 



'SP12 



ACCEPT A VOICE 

I 



RECOGNIZE A VOICE TO 
BE CATALOGED 



i 



PRESENT A RESULT 
OF RECOGNITION 



RECEIVE A CONFIRMATION 
INPUT 



'SP13 
'SP14 

'SP15 
'SP16 




ACCEPT A WORD ATTRIBUTE 



RECOGNIZE A VOICE FOR 
A CONVERSATION 



PRESENT A RESULT 
OF RECOGNITION 



RECEIVE A CONFIRMATION 
INPUT 



'SP18 
'SP19 

'SP20 

'SP21 




RECORD THE WORD AND 
THE ATTRIBUTE 



3 



'SP23 



( END ) — ^SP24 



FIG. 15 



46 



EP1 072 297A1 



RECOGNITION DATA 
't6A 




r 



DA 



VOICE 
RECOGNITION 
MODULE 



VOICE 
ijlAUTHENTICATIOI 
MODULE 




AUTHENTI- AUTHENTI-^ 
CATION CATION 
DATA STATE 




RESPONSE 



lie 



PHYSICAL-CONDITION 
DATA 



PHYSICAL-CONDITION 
CHANGI NG MODULE 

HE 



- TIMER 



CHARACTER DATA 
-16D 




EMOTION CHANGING 
MODULE 



^IID 



CURRENT-EMOTION 
DATA 



16C 




RESPONSE-SENTENCE 
CREATION MODULE 




t^llE 



KNOWLEDGE 
BASE 



CONVERSATION PATTERN 
HISTORY DATA 



1: ELECTRONIC PET 
~ APPARATUS 



FIG. 16 



47 




EP 1 072 297 A1 



us 



LU 



i = 



a. 

>- 

LU 



00 



LU 

CO 



!t> 



I 



O 

<: 



i 



00 



83 



: UJ £ 



CO 

i 

00 
LU 



ill 



LU a> LU 

« -M IK 
CO 

o 

•M ^ 3 
^ -M O 
O 3 >- 
-H <0 . 

CO loo 

to 4J LU 
^ 07 >- 
> CO » 



0 



LU 



eg 



48 



o 



EP 1 072 297 A1 



O 



SYSTEM: "WHAT IS YOUR FAVOTRITE FOOD. MASTER?" 
USER: "PEANUTS." 



FIG. 18 



SYSTEM: "ARE YOU REALLY THE MASTER? IS YOUR FAVORITE FOOD? 
USER: "PEANUTS." 

SYSTEM: "YOU ARE REALLY THE MASTER!" 



FIG. 19 



49 



o 



EP 1 072 297 A1 



O 



RECOGNITION DATA 
16A 



11A 




DA 

XlVOICE RECOGNITION 
^ MODULE 



WORD/PHRASE 
CLASSIFICATION 
HOOULE 



lie 



16tl 



CLASSIFI- 
CATION 
CODE 




CLASSIFICATION 
RULE 

RESPONSE 



PHYSICAL-CONDITION 
CHANGING MODULE 



PHYSICAL-CONDITION 
DATA 

16B 



IIB 




TIMER 



11D 



CHARACTER DATA 
16D 




EMOTION CHANGING 
MODULE 




.CURRENT-EMOTION 
DATA 

16C 




RESPONSE-SENTENCE 
CREATION MODULE 




KNOISLEDGE CONVERSATION PATTERN 
BASE HISTORY DATA 



naECTRONIC PET 
APPARATUS 



FIG. 20 



50 




51 



EP 1 072 297 A1 



ACTOR 


TYPE 


DESCRIPTION 


USER 


GREETING 


GOOD DAY 


SYSTEM 


GREETING 


HI 


USER 


QUERY 


HOW ARE YOU? 


SYSTEM 


STATE 


1 AM FINE 



16F 



FIG. 22 



ACTOR 


TYPE 


DESCRIPTION 


SYSTEM 


IMPRESSION 


BORING 


USER 


STATE 


1 AN HUNGRY 


SYSTEM 


GREETING 


GOOD DAY 


USER 


GREETING 


HI 



16F 



FIG. 23 



52 



EP1 072297A1 



lie 



RECOGNITION DATA 
16A 



TIMER 



DA, 

-U VO 



11A 



CE 



RECOGNITION 
NODULE 



RESPONSE 




PHYSICAL-CONDITION 
CHANGING MODULE 



PHYSICAL-CONDITION 
DATA 

166 



11B 



CHARACTER DATA 
t6D 




EMOTION CHANGING 
MODULE 



CURRENT-EMOTION 
DATA 

16C 



RESPONSE-SENTENCE 
CREATION MODULE 




KN0HLE06E CONVERSATION PAHERN DATA 
BASE HISTORY 



1 .ELECTRONIC PET APPARATUS 



FIG. 24 



53 



EP 1 072 297 A1 



CO 
UJ 



2g 



00 



5 



Of 
UJ 
CO 



o 

CM 
+ 

UJ 

a: 



o 

UJ 



O 
+ 



-I- 
Ol 

B 

00 

I 

as 

S5 

Li. 
T 



+ 



Ui Ml 

w Si 



00 

in 



a. 

ID 

00 

I 



o 



CN* 
4- 



00 
00 



00 



in 
I 

>^ 
o 

00 

to 



8S 
5 



C9 



5 5 



g 

cc g oc 
^ ^ 



CD 



3 2 5 



— o eg 
o <s < 



in 



54 



EP 1 072 297 A1 



160 







KEYWORD 






GOOD 


BAD 


HEY 


DIRH 


CURRY BREAD 




ANGER 


- 1 


+ 1 0 


+ 5 


+ 5 


+ 5 


11 


SADNESS 


+ 2 


+ 1 0 


+ 5 


+ 5 


+ 5 




JOY 


+ 20 


- 1 0 


-1 0 


- 1 5 


- 1 5 


frig 


FEAR 


-5 


+ 5 


+ 1 0 


-5 


-5 


it 


SURPRISE 


+ 5 


-1 


+ 1 0 


+ 5 


+ 5 






- 1 


+ 5 


+ 2 


+ 2 0 


+ 20 



FIG. 26 
160 







KEYWORD 






6000 


BAD 


HEY 


DIRTY 




ANGER 


- 1 


+ 1 0 


+ 5 


+ 4 


11 


SADNESS 


+ 2 


+ 1 0 


+ 5 


+ 4 




JOY 


+ 20 


- 1 0 


-1 0 


- 1 3 


fTlQ 


FEAR 


-5 


+ 5 


+ 1 0 


-4 




SURPRISE 


+ 5 


- 1 


+ 1 0 


+ 4 




HATRED 


- 1 


+ 5 


+ 2 


+ 1 6 



FIG. 27 



55 




56 



EP1 072 297 A1 



□ 




□ □□ 
m cEi [zi 



6 



9 



□ [!]□ 

□ □□ 




PORTABLf TELEPHONE 



FiG. 29 



57 



EP 1 072 297 A1 



nnVRNATIONAL SEARCH REPORT 



Imenidau) ftpptieatioa No. 

PCT/JP99/07271 



A. aASSmCATl(»( OF SUBJECT MATTER 

Int. CI A«3ria/00. A63713/I0, A63?I3/12, GiaU3/00. 
QaOL15/00« 010L17/00, O0m7/2Dr Q06P9/44 



B. FIELDS SBAlbCHBD 



ieaichri(dntifir>ikniyittnfeaarw<<byctmififito 
Int. CI A€3n3/00, A63FX3/10, A€aP13/12, 610L13/00, 
GIOLXS/OO, O10U7/00. 006717/20. a06F9/44 



Jltmiyo fihliwi Kdbo 1922-1996 
Xbkai JitmQv Sblnan KtabD 1971-2000 



iittlncliidadiBtefieldii 

Tcxrolcu Jlcsuyo fihixiaa Kbho 1994-2000 
JltBuyo Shisan Ttoaku Kdho 1996-2000 



C DOCiniBNTSaillSIDBBSDTOBBKSUVAirr 



CMegoiy* 



Xdwnl Id cUb No. 



J?, 10-276462, A (Canoo Inc.), 
13 October, 199B (X3.10.9B>, 
Pull text; all dr»%ring8 



Pull text; all drmvinga 
(Fanllyi nona) 



J7, 10-260976, A (Ricob Con^any, 
29 Saptmbar, 199B (29.09.98), 
Pull text I all drawings 



Full text; all drawinga 
(Family: none) 



Ltd.), 



22-25,28-31,34 
-37,40-43,45-4 
8,60-81,83-84, 
86-87,89-90,92 
-93 

1-21,26-27,32- 
33,38-39,44,49 
-79,82,85,88,9 
1,94-109 



22-25,28-31,34 
-37,40-43.45-4 
8,80-81,83-84, 
86-67,89-90,92 

-93, 
1-21,26-27,32- 
33,38-39,44,49 
-79,62^,88,9 



^ F«tbad9owMatoaiiiaBdlalh«i 



priMfiydtflcKliMfao«i£lkl«iAAi^PFiM«bQt«ilidM 



■tfdMdi 



dai 

V toii»BlvldibBqfafM4aiteflBpri«i9cbiD<i)Qrii^k 



ilflfii«da«taibi« 



■nbvcriteHMfMiani^ 



29 March, 2000 (29.03.00) 



11 April, 2000 (11.04.00) 



MsM Mdnailk^ addni of ISA/ 
Japaneae Patent Office 

FtevniloHo. 



T«bphoMKa 



Foim PayXSAy210 (IQC0B4 ibeoO (Joly 1992) 



58 



EP1 072 297 A1 



INTERNATIONAL SEARCH BKPORT 



PCT/JP99/07271 



I). DOOUMErmOONSDEREDTOBBRELBVAKr 



RdfifisUtucUSmNa 



Jp, xo-313357. A (HSC Corporation) « 
24 Hoveaber, 1998 (24.ll.9ah 
Pull text; all drawiogfl 



Poll text; all drawiaga 
(Pamll/: nooe) 

JP, 9-305787, A (Sharp Coxporacioa) , 

28 Voirenber, 1997 (28.11.97), 

Pall text; all dravingfl (Family: none) 



1« 94-109 



50-54,56-60,62 
66,66*72,74-7 

a 

1-49,55,61,67, 
73,79-109 

1*109 



FQanPCr/iaAaiO(caiidmittkii of second 1^ 1992) 



59 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
f documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 
M.OTHER: ^^-<^TVU1^ f- 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



